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Vlll SUGGESTIONS FOR USE OF BOOK 

and IV. The problems of Chapter IV are investigated for the groups 
introduced in Chapter III. 

Any one of these chapters contains too much material for an 
advanced undergraduate course or seminar. I could make the fol- 
lowing suggestions for the content of such courses. 

1) The easier parts of Chapter II. 

2) The linear algebra of the first five paragraphs of Chapter I 
followed by selected topics from Chapter III, either on orthogonal 
or on symplectic geometry. 

3) The fundamental theorem of projective geometry, followed by 
some parts of Chapter IV. 

4) Chapter III, but with the following modification: 
All that is needed from 4 of Chapter I is the statement : 

If W* is the space orthogonal to a subspace W of a non-singular 
space V then dim W + dim W* = dim V. This statement could 
be obtained from the naive theory of linear equations and the in- 
structor could supply a proof of it. Our statement implies then 
TF** = W and no further reference to 4 of Chapter I is needed. 
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CHAPTER 1 
Preliminary Notions 

1. Notions of set theory 

We begin with a list of the customary symbols: 

a e S means a is an element of the set S. 

S C T means S is a subset of T. 

S C\ T means the intersection of the sets S and T; should 

it be empty we call the sets disjoint. 
S U T stands for the union of S and T. 

C\i Si and \Ji 8^ stand for intersection and union of a family of 
indexed sets. Should S t - and S, be disjoint for i ?& j we call W, S, a 
disjoint union of sets. Sets are sometimes defined by a symbol { } 
where the elements are enumerated between the parenthesis or by a 
symbol {x\ A } where A is a property required of x; this symbol is read: 
"the set of all x with the property A". Thus, for example: 

S r\ T = {x\xe S,xtT}. 

If / is a map of a non-empty set S into a set jP, i.e., a function f(s) 
defined for all elements s e S with values in T, then we write either 

/ : S -> T or S A T. 

If S -A T and T -A U we also write S-^T-^U.IfssS then we 
can form g(f(s)) e U and thus obtain a map from S to U denoted by 
S ^ U. Notice that the associative law holds trivially for these 
"products" of maps. The order of the two factors gf comes from 
the notation /($) for the image of the elements. Had we written (s)f 
instead of /(s), it would have been natural to write fg instead of gf. 
Although we will stick (with rare exceptions) to the notation f(s) 
the reader should be able to do everything in the reversed notation. 
Sometimes it is even convenient to write s f instead of /($) and we 
should notice that in this notation ($')" = s' f . 
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If S -A- T and S C S then the set of all images of elements of S 
is denoted by /( ); it is called the image of S {} . This can be done 
particularly for S itself. Then f(S) C T; should f(S) = T we call 
the map onto and say that / maps S onto T. 

Let T be a subset of T. The set of all s t S for which J(s) t T is 
called the inverse image of T and is denoted by f~ l (T ). Notice 
that f~ l (T ) may very well be empty, even if T Q is not empty. 
Remember also that /"* is not a map. By f~ l (f) for a certain / e T 
we mean the inverse image of the set {/} with the one element /. It 
may happen that /~ l (0 never contains more than one clement. 
Then we say that / is a one-to-one into map. If / is onto and one-to- 
one into, then we say that / is one-to-one onto, or a "one-to-one cor- 
respondence." In this case only can f~ l be interpreted as a map 
T '-^i S and is also one-to-one onto. Notice that /""*/ : S > S and 
ff~ l : T > T and that both maps are identity maps on S respectively 
T. 

If /, j t 2 are elements of T, then the sets f~ l (ti) and f" l (t 2 ) are 
disjoint. If s is a given element of S and /(s) = /, then s will be in 
/~ l (0> which shows that S is the disjoint union of all the sets f~ l (t): 



Some of the sets /"*(/) may be empty. Keep only the non-empty 
ones and call S f the set whose elements are these non-empty sets 
f~ l (). Notice that the elements of S f are sets and not elements of *S. 
S f is called a quotient set and its elements are also called equivalence 
classes. Thus, Si and s 2 are in the same equivalence class if and only 
if f(si) = /(s 2 )- Any given clement s lies in precisely one equivalence 
class; if /(s) = /, then the equivalence class of s is f~ l (f). 

We construct now a map /, : S * 8, by mapping each 5 e S onto 
its equivalence class. Thus, if /(s) = /, then /i(s) = J~ l (t). This 
map is an onto map. 

Next we construct a map / 2 : S f > f(S) by mapping the non- 
empty equivalence class f~ l (f) onto the element / e f(S). If / e f(S), 
hence / = /(s), then / is the image of the equivalence class f~ l (f) 
and of no other. This map / 2 is therefore one-to-one and onto. If 
s e S and /() = /, then /,(s) = f~ l (t) and the image of /"*(/) under 
the map / 2 is t. Therefore, / 2 /i(s) = /. 

Finally we construct a very trivial map / 3 : /(/S) > T by setting 
/ 3 (/) = t f or / E f(S). This map should not be called identity since 
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it is a map of a subset into a possibly bigger set T. A map of this 
kind is called an injection and is of course one-to-one into. For 
/(s) = / we had / 2 /i(s) = t and thus / 3 / 2 /i(s) = /. We have 
S^S, f -> f(S) ^> T, so that / 3 / 2 /i : S -> T. We see that our original 
map / is factored into three maps 



To repeat: /i is onto, / 2 is a one-to-one correspondence and / 3 is 
one-to-one into. We will call this the canonical factoring of the map /. 
The word "canonical" or also "natural" is applied in a rather loose 
sense to any mathematical construction which is unique in as much 
as no free choices of objects are used in it. 

As an example, let G and H be groups, and / : G > // a homo- 
morphism of G into H, i.e., a map for which f(xy) = f(x)f(y) holds 
for all x, y c G. Setting x = y = 1 (unit of G) we obtain /(I) = 1 
(unit in H). Putting y = of 1 , we obtain next f(x~ l ) = (f(x))~ l . We 
will now describe the canonical factoring of / and must to this effect 
first find the quotient set G r . The elements x and y are in the same 
equivalence class if and only if f(x) = f(y) or f(xy~ l ) = 1 or also 
f(y~ l x) 1; denoting by K the inverse image of 1 this means that 
both xy~ l e K and y~ l xzK (or x e Ky and x e yK). The two cosets yK 
and Ky are therefore the same and the elements x which are equiva- 
lent to y form the coset yK. If we take y already in K, hence y in 
the equivalence class of 1 we obtain yK = K, so that K is a group. 
The equality of left and right cosets implies that K is an invariant 
subgroup and our quotient set merely the factor group G/K. The 
map /i associates with each a: e G the coset xK as image: f^(x) = xK. 
The point now is that /i is a homomorphism (onto). Indeed fi(xy) = 
xyK - xyK-K = x-Ky-K = xK-yK = f^f^y). 

This map is called the canonical homomorphism of a group onto 
its factor group. 

The map / 2 maps xK onto f(x) : f z (xK) = f(x). Since f. 2 (xK-yK) = 
f 2 (xy-K) = f(xy) = f(x)f(y) = f z (xK)f 2 (yK) it is a homomorphism. 
Since it is a one-to-one correspondence it is an isomorphism and 
yields the statement that the factor group G/K is isomorphic to the 
image group f(G). The invariant subgroup K of G is called the kernel 
of the map /. 

The map / 3 is just an injection and therefore an isomorphism 
into H. 
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2. Theorems on vector spaces 

We shall assume that the reader is familiar with the notion and 
the most elementary properties of a vector space but shall repeat 
its definition and discuss some aspects with which he may not 
have come into contact. 

DEFINITION 1.1. A right vector space V over a field k (k need 
not be a commutative field) is an additive group together with a 
composition Aa of an element A t V and an element a e k such that 
Aa E V and such that the following rules hold: 

1) (A + E)a = Aa + Ba, 2) A(a + 6) = Aa + Ab, 
3) (Aa)b = A(ab), 4) A-l = A, 

where A, B e V y a, b t k and where 1 is the unit element of k. 

In case of a left vector space the composition is written aA and 
similar laws are supposed to hold. 

Let V be a right vector space over k and S an arbitrary subset of V. 
By a linear combination of elements of S one means a finite sum 
A&i + A 2 a 2 + + A r a r of elements A % of S. It is easy to see that 
the set (S) of all linear combinations of elements of S forms a subspace 
of V and that (S) is the smallest subspace of V which contains S. 
If S is the empty set we mean by (S) the smallest subspace of V 
which contains S and, since is in any subspace, the space (S) 
consists of the zero vector alone. This subspace is also denoted by 0. 

We call (S) the space generated (or spanned) by S and say that 

5 is a system of generators of (S). 

A subset S is called independent if a linear combination 
A^i + A 2 a 2 + + A r a r of distinct elements of S is the zero 
vector only in the case when all a, = 0. The empty set is therefore 
independent. 

If S is independent and (S) = V then S is called a basis of V. 
This means that every vector of V is a linear combination of distinct 
elements of S and that such an expression is unique up to trivial 
terms A-0. 

If T is independent and L is any system of generators of V then 
T can be "completed" to a basis of V by elements of L. This means 
that there exists a subset L Q of L which is disjoint from T such that 
the set T VJ L is a basis of V. The reader certainly knows this 
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statement, at least when V is finite dimensional. The proof for the 
infinite dimensional case necessitates a transfinite axiom such as 
Zorn's lemma but a reader who is not familiar with it may restrict 
all the following considerations to the finite dimensional case. 

If V has as basis a finite set S, then the number n of elements of S 
(n = if S is empty) depends only on V and is called the dimension 
of V. We write n = dim V. This number n is then the maximal 
number of independent elements of V and any independent set T 
with n elements is a basis of V. If U is a subspace of F, then dim U < 
dim V and the equal sign holds only for U = V. 

The fact that V does not have such a finite basis is denoted by 
writing dim V = o . A proper subspace U of V may then still have 
the dimension < . (One could introduce a more refined definition of 
dim F, namely the cardinality of a basis. We shall not use it, however, 
and warn the reader that certain statements we are going to make 
would not be true with this refined definition of dimension.) 

The simplest example of an n-dimensional space is the set of all 
n-tuples of elements of k with the following definitions for sum and 
product: 

(xi , x 2 , , x n ) + (*/i , 2/2 , , y n ) = (xi + 2/1 , , x n + */,), 

, x 2 a, , x n a). 



If U and W are subspaces of V (an arbitrary space), then the space 
spanned by U U W is denoted by U + W. Since a linear combination 
of elements of U is again an element of U we see that U + W con- 
sists of all vectors of the form A + B where A e U and B e W. The 
two spaces U and W may be of such a nature that an element U + W 
is uniquely expressed in the form A + B with A t U, B t W. One 
sees that this is the case if and only if U r\ W = 0. We say then 
that the sum U + W is direct and use the symbol U W. Thus 
one can write U W for U + W if and only if U H W = 0. 

If Ui , U 2 , C/3 are subspaces and if we can write (U v C7 2 ) U z , 
then an expression A l + A 2 + A 3 with A,- E 7, is unique and thus 
one can also write U l (C7 2 C/ 3 ). We may therefore leave out 
the parenthesis: U l C7 2 C7 3 . An intersection of subspaces is 
always a subspace. 

Let U now be a subspace of V. We remember that V was an additive 
group. This allows us to consider the additive factor group V/U 
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whose elements are the cosets A + U. (A + V for an arbitrary but 
fixed A e V means the set of all vectors of the form A + B, B E U.) 
Equality A^ + U = A 2 + U of two cosets means A l A 2 t U, 
addition is explained by (A 1 + U) + (A 2 + U) = (A l + A 2 ) + U. 
We also have the canonical map 

<p : V - V/U 

which maps A e V onto the coset A + U containing A. The map <p 
is an additive homomorphism of V onto V/U. We make V/U into 
a vector space by defining tho composition of an element A + U of 
V/U and an element a t k by: 

(A + E7)-a = Aa+ U. 

One has first to show that this composition is well defined, i.e., 
does not depend on the particular element A of the coset A + U. 
But if A + U = B + U, then A - B e U, hence (A - B)a t U which 
shows Aa + U = Ba + U. That the formal laws of Definition 1.1 
are satisfied is pretty obvious. For the canonical map <p we have 

<f>(Aa) = Aa + U = (A + U)-a = <p(A)-a 

in addition to the fact that <p is an additive homomorphism. This 
suggests 

DEFINITION 1.2. Let V and W be two right vector spaces (W 
not necessarily a subspace of V) over fc. A map / : V > W is called 
a homomorphism of V into W if 

1) /(A + B) = /(A) + /(B), A e V and Be 7, 

2) /(Aa) = /(A) -a, A e 7 and a e *. 

Should / be a one-to-ono correspondence, we call / an isomorphism 
of V onto W and we denote the mere existence of such an isomorphism 
by V ~ W (read: "V isomorphic to W"). 

Notice that such a homomorphism is certainly a homomorphism 
of the additive group. The notion of kernel U of / is therefore already 
defined, U = /"'(O), the set of all A e 7 for which /(A) = 0. If 
A c U then /(A a) = /(A) -a = so that Aa e C7. This shows that 
U is not only a subgroup but even a subspace of 7. 

Let U be an arbitrary subspace of 7 and ^ : 7 7/ U the canonical 
map. Then it is clear that <p is a homomorphism of 7 onto V/U. 
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The zero element of V/U is the image of 0, hence U itself. The kernel 
consists of all A e V for which 

v(A) = A + U = U. 

It is therefore the given subspace U. One should mention the special 
case U = 0. Each coset A + U is now the set with the single element 
A and may be identified with A. Strictly speaking we have only a 
canonical isomorphism F/0 ~ V but we shall write F/0 = V. 

Let us return to any homomorphism / : V > W and let U be the 
kernel of F. Since / is a homomorphism of the additive groups we 
have already the canonical splitting 

V l> V/U /(F) ^> W 

where /i(A) = A + U is the canonical map V > V/U, where 
f 2 (A + U) = f(A) and, therefore, 

M(A + C7)a) = f,(Aa + U) = f(Ad) = /(A)o = /,(A + t/)a 

and where / 3 is the injection. All three maps are consequently homo- 
morphisms between the vector spaces, and / 2 is an isomorphism 
onto. We have, therefore, 

THEOREM 1.1. To a given homomorphism f :V * W with kernel 
U we can construct a canonical isomorphism / 2 mapping V/U onto 
the image space f(V). 

Suppose now that U and W are given subspaces of V. Let <p be 
the canonical map V -** V/U. The restriction ^ of <p to the given 
subspace TF is a canonically constructed homomorphism W -* V/U. 
What is iKJF)? It consists of all cosets A + U with A t W. The 
union of these cosets forms the space W + U, the cosets A + U 
are, therefore, the stratification of W + U by cosets of the subspace 
U of W + U. This shows f(W) = (U + TP)/C7. What is the kernel 
of ^? For all elements A t W we have \f/(A) = <p(A). But v? has, in 
V, the kernel U so that ^ has U H TF as kernel. To ^ we can construct 
the canonical map ^ 2 which exhibits the isomorphism of W/(U C\ W) 
with the image (U + W)/U. Since everything was canonical we have 

THEOREM 1.2. If U and W are subspaces of V then (U + W)/U 
and W/(U C\ W] are canonically isomorphic. 

In the special case V = U W we find that V/U and W/(U n TF) 
= W/0 = TF are canonically isomorphic. Suppose now that 
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only the subspace U of V is given. Does there exist a subspace W 
such that V = U W? Such a subspace shall be called supple- 
mentary to U. Let S be a basis of U and complete S to a basis S VJ T 
of V where S and T are disjoint. Put T7 = (T), then f7 + W = 7 
and obviously V = 7 TT. This construction involves choices and 
is far from being canonical. 

THEOREM 1.3. To every subspace U of V one can find (in a non- 
canonical way) supplementary spaces W for which V = U W. 
Each of these supplementary subspaces W is, however, canonically iso- 
morphic to the space V/U.IfV = U W l = U W 2 then W, is 
canonically isomorphic to W 2 . 

If / : V W is an isomorphism into then the image f(S) of a 
basis of V will at least be independent. One concludes the inequality 
dim V < dim W. Should / be also onto then equality holds. 

In our construction of W we also saw that dim V = dim U + 
dim W and since W ~ V/U one obtains 

dim V = dim U + dim F/C7 
hence also, whenever V = / W, that 

dim 7 = dim 17 + dim F. 

Let now t/i C C/ 2 C C^ be subspaces of V. Find subspaces T7 2 
and TF 3 such that 



and, therefore, 



We have dim U 2 /Ui = dim W 2 , dim r/ 3 /C7 2 = dim W 3 and 
dim U 3 /U, = dim(TT 2 TF 3 ) = dim W 2 + dim W 3 . Thus we have 
proved: if U l C ^2 C *7 3 , then 

(1.1) dim t/a/f/! = dim ^/EA + dim U 3 /U 2 . 

Let now U and W be two given subspaces of V. Use (1.1) for 
U, = 0, C7 2 = C7, C/3 = U + W. We obtain 

dim(C7 + W) = dim U + dim(C7 + W)/U 
= dim C7 + dim TF/(C7 H W). 
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If we add on both sides dim(C7 H W) and use dim W /(U C\ W) + 
dim(U H W) = dim W we get 

dim(C7 + F) + dim(C7 H W) = dim U + dim TF. 
Next we use (1.1) for CA = U Ci W, U 2 = W, U 3 = V: 

dim V/(U H WO = dim TF/(C7 n WO + dim F/TF 
= dim (17 + WO/17 + dim 7/TF. 
If we add dim V/(U + W) and use 

dim V/(U + W) + dim (U + W)/U = dim V/U 
we obtain 
dim V/(U +W) + dim V/(U H WO = dim F/J7 + dim F/TP. 

If the dimension of V is finite all subspaces of V have finite dimen- 
sion. If, however, dim V = , then our interest will be concentrated 
on two types of subspaces U. Those whose dimension is finite and, 
on the other hand, those which are extremely large, namely those 
which have a finite dimensional supplement. For spaces of the second 
type dim U but dim V/U is finite; dim U tells us very little 
about U, but dim V/U gives us the amount by which U differs from 
the whole space V. We give, therefore, to dim V/U a formal status by 

DEFINITION 1.3. The dimension of the space V/U is called the 
codimension of U: 

codim U = dim V/U. 
The various results we have obtained are expressed in 

THEOREM 1.4. The following rules hold between dimensions and 
codimensions of subspaces: 

(1.2) dim U + codim U == dim 7, 

(1.3) dim(f7 + WO + dim(f7 H W) = dim U + dim W, 

(1.4) codim(C7 + WO + codim([7 H W) = codim U + codim W. 

These rules are of little value unless the terms on one side are 
finite (then those on the other side are also) since an < could not 
be transposed to the other side by subtraction. 
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Spaces of dimension one are called lines, of dimension two planes 
and spaces of codimension one are called hyperplanes. 

3. More detailed structure of homomorphisms 

Let V and V be right vector spaces over a field k and denote by 
Hom(7, V) the set of all homomorphisms of V into 7'. We shall 
make Horn (7, V) into an abelian additive group by defining an 
addition: 

If / and g are e Hom(7, V), let / + g be the map which sends 
the vector X e V onto the vector f(X) + g(X) of 7'; in other words, 

(/ + g)(X) = f(X) + g(X). 

That / + g is a homomorphism and that the addition is associative 
and commutative is easily checked. The map which sends every 
vector X e V onto the vector of V is obviously the element of 
Hom(7, 7') and shall also be denoted by 0. If / e Hom(7, 7'), 
then the map / which sends X onto (f(X)) is a homomorphism 
and indeed / + (/)= 0. The group property is established. 

In special situations it is possible to give more structure to 
Hom(7, 70 and we are going to investigate some of the possibilities. 

a) V = 7. 

An element of Hom(7, 7) maps 7 into 7; one also calls it an 
endomorphism of 7. If /, g E Hom(7, 7), then it is possible to com- 
bine them to a map gf : V -A 7 -* 7 as we did in 1 : gf(X) = g(f(X)). 
One sees immediately that gf is also a homomorphism of 7 > 7. 

Since 



= 01/00 

and 

0CA + /0(*) = rffiGK) 

= (0/1 + 

we see that both distributive laws hold; Hom(7, 7) now becomes 
a ring. This ring has a unit element, namely the identity map. 

The maps / which are a one-to-one correspondence lead to an 
inverse map /" l which is also in Horn (7, 7). These maps / form 
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therefore a group under multiplication. All of Chapter IV is devoted 
to the study of this group if dim V is finite. 

Let us now investigate some elementary properties of Hom(F, V) 
if dim V = n is finite. Let / e Hom(F, F) and let U be the kernel 
of /. Then V/U c~ /(F) so that the dimension of the image /(F) 
is n dim U. This shows that / is an onto map if and only if dim 
[7 = 0, i.e., if and only if / is an isomorphism into. 

Let A l , A 2 , , A n be a basis of F and set f(A<) = B t . If 
X = A&i + A 2 x 2 + + A n x n e F then 

(1.5) /(X) = B lXl + B 2 x> + + B n x n . 

Conversely choose any n vectors B { e F and define a map / by (1.5). 
One sees easily that / e Hom(F, F) and that f(Ai) = B< . Conse- 
quently / is completely determined by the images B { of the basis 
elements A i and the J? t can be any system of n vectors of F. If we 
express each B t by the basis A, , 

n 

f(A,) = B, = 53 A,a, i9 j = 1, 2, , n, 

F-l 

then we see that / is described by an n-by-n matrix (a,,-) where i is 
the index of the rows and j the index of the columns. 

Let g t Hom(F, F) be given by the matrix (6,-,) which means that 



Then 

(/ + g)(A t ) = E A.(a., + 

x-l 

and 



4 z 

\ F -l 
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We see that / + g is described by the matrix (a tl - + &<,) and fg 
by (2"-i a A/)- This is the reason for defining addition and multi- 
plication of matrices by 



Under this definition of addition and multiplication the corre- 
spondence / * (a,-/) becomes an isomorphism between Hom(F, V) 
and the ring of all n-by-n matrices. 

This isomorphism is far from canonical since it depends on the 
choice of the basis A t - for V. 

Let g be another element of Hom(F, F), but suppose that g is 
one-to-one. Let (&,,-) be the matrix associated with the element 
gfg~ l of Hom(F, F). The meaning of the matrix (&,-,-) is that 

0/flf '(A,) = E A 9 b . 

-l 

If we apply g' 1 to this equation it becomes 



Since 0f l is any one-to-one onto map of F the vectors g~ l (A w ) are 
another basis of F, and g can be chosen in such a way that gT^A,) 
is any given basis of F. Looking at the equation from this point of 
view we see that the matrix (&,-/) is the one which would describe / 
if we had chosen g~ l (A 9 ) as basis of F. Therefore: 

The matrix describing / in terms of the new basis is the same as 
the one describing gfg' 1 in terms of the old basis A, . In this state- 
ment g was the map which carries the "new basis" g~ l (A 9 ) into the 
old one, 

g(g- l (A 9 )) = A, . 

This g is, therefore, a fixed map once the new basis is given. Suppose 
now that / > A, g > D are the descriptions of / and g in terms of 
the original basis. Then gfg' 1 > DAD" 1 . The attitude should be 
that g is fixed, determined by the old and the new basis, and that / 
ranges over Hom(F, F). We can state 
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THEOREM 1.5. The ring Hom(F, F) is isomorphic to the ring of 
all n-by-n matrices with elements in k. The isomorphism depends on 
the choice of a basis. Let g be an element of Hom(F, F) which carries a 
selected new basis into the old one and suppose that g D describes g 
in terms of the old basis. If f > A is the description of any f in terms 
of the old basis, then DAD' 1 is the description of this same f in terms 
of the new basis. 

Mathematical education is still suffering from the enthusiams 
which the discovery of this isomorphism has aroused. The result 
has been that geometry was eliminated and replaced by computa- 
tions. Instead of the intuitive maps of a space preserving addition 
and multiplication by scalars (these maps have an immediate geo- 
metric meaning), matrices have been introduced. From the innumer- 
able absurdities from a pedagogical point of view let me point 
out one example and contrast it with the direct description. 

Matrix method: A product of a matrix A and a vector X (which is 
then an n-tuple of numbers) is defined; it is also a vector. Now the 
poor student has to swallow the following definition: 

A vector X is called an eigen vector if a number X exists such that 

AX = \X. 

Going through the formalism, the characteristic equation, one 
then ends up with theorems like: If a matrix A has n distinct eigen 
values, then a matrix D can be found such that DAD~ l is a diagonal 
matrix. 

The student will of course learn all this since he will fail the course 
if he does not. 

Instead one should argue like this: Given a linear transformation / 
of the space F into itself. Does there exist a line which is kept fixed 
by /? In order to include the eigen value one should then modify 
the question by asking whether a line is mapped into itself. This 
means of course for a vector spanning the line that 

f(X) = \X. 

Having thus motivated the problem, the matrix A describing / 
will enter only for a moment for the actual computation of X. It 
should disappear again. Then one proves all the customary theorems 
without ever talking of matrices and asks the question: Suppose 
we can find a basis of F which consists of eigen vectors; what does 
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this' imply for the geometric description of /? Well, the space is 
stretched in the various directions of the basis by factors which are 
the eigen values. Only then does one ask what this means for the 
description of / by a matrix in terms of this basis. We have obviously 
the diagonal form. 

I should of course soften my reproach since books have appeared 
lately which stress this point of view so that improvements are to 
be expected. 

It is my experience that proofs involving matrices can be shortened 
by 50% if one throws the matrices out. Sometimes it can not be 
done; a determinant may have to be computed. 

Talking of determinants we assume that the reader is familiar 
with them. In Chapter IV we give a definition which works even in 
the non-commutative case; let us right now stick to the commutative 
case. If k is a commutative field and / c Hom(F, F), then we define 
the determinant of / as follows: Let A be a matrix describing / and 
put det / = det A. 

If we use a new basis, A has to be replaced by DAD~ l , the deter- 
minant becomes det D- det A- (det D)" 1 by the multiplication 
theorem of determinants; since k is commutative det D cancels and 
we see that the map 

/ -> det / = det A 

is well defined and canonical. If g corresponds to the matrix B, then 
fg corresponds to the matrix AB and the multiplication theorem 
shows det fg = det /-det g. 

THEOREM 1.6. There exists a well defined map 

Hom(F, F) > k (if k is commutative) 
called the determinant of an endomorphism /. // satisfies 
det(/0) = det /-det g. 

In view of this fact it should be possible to describe det / in an 
intrinsic manner. The reader will find such a description in Bourbaki, 
Algfcbre, Chapter III. 

6) V is a two-sided space. 

DEFINITION 1.4. Suppose that V is both a right and a left 
vector space over k and that we have the additional rule (aA)b = 
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a(Ab) for all a,btk and A t V. Then we call V a two-sided space 
over k. 

THEOREM 1.7. // V is a right and V a two-sided vector space 
over 7c, then Hom(F, V) can be made into a left vector space over k in a 
canonical way. 

To this effect we have to define a product af for a e k and 
/ e Hom(F, V). We mean by af the function which sends the vector 
X e V onto the vector a-f(X) in V. Since 

(af)(X + 7) = a-f(X + 7) = a 



and 

(a/)(X6) = a-f(Xb) = a 

a/Hom(F, 7'). 
The equations 

(a(/ + 0))(X) = a((f + g)(X)) = a(f(X) 

= (af)(X) + (ag)(X) = (af + ajr)(X), 
((a + 6)/)(X) = (a + 6)-/(Z) = a/(X) + 6/(X) = (a/ + 



show that IIom(F, F') is a left space over k. 

The question arises: Can one make any right space V in a natural 
way into a two-sided space by defining a product aX from the left? 
One thinks of course of the definition aX = Xa. But then a(bX) = 
(bX)a = (Xb)a = X(ba) = (ba)X, whereas we should have obtained 
(ab)X. This "natural" definition works only if the field k is com- 
mutative. 

THEOREM 1.7'. // V and V are vector spaces over a commutative 
field k, then Hom(F, V) can be made in a natural way into a vector 
space over k. 

If k is again any field, then the most important example of a 
two-sided vector space V is the field k itself if one defines addition 
and multiplication as they are defined in the field. We shall investigate 
this case in the next paragraph. 
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4. Duality and pairings 

DEFINITION 1.5. If V is a right vector space over fc, then the 
set V = Hom(F, k) is a left vector space over k called the dual of V. 
The elements ^, ^, of t^ are called functionals of V. To repeat 
the definition of a functional <p: it is a map V A fc such that 



J5) = <p(A) + <p(B), 2) 
The operations between functionals are 
3) fe + *)(A) = *(A) + *(A), 4) 



If F is a left vector space over /c we also define the dual space 
t^ = Hom(F, fc). In order to obtain complete symmetry we write, 
however, (AV instead of <f>(A). V is a right vector space over k. 

The notation will become still simpler if we write <pA instead of 
<p(A) (and A<? instead of (A)<p if V is a left space). Rewriting 1) and 3) 
these rules take on the form of "distributive laws" and 2) and 4) 
become "associative laws". In other words let us regard <pA as a 
kind of "product" of an element <p e V* and an element A e V such 
that <pA E k. 

This change of notation suggests a generalisation where a right 
space V and a left space W are given (W not necessarily the dual 
of V), together with a product AB e k for A e W, B E V. 

DEFINITION 1.6. If W is a left and V a right vector space over fc, 
we say that a pairing of W and V into F is given, provided a product 
AB e fc is defined for all A E TF and all B e F such that the following 
rules hold: 

1) A(B l + B 2 ) = AB, + AB 2 , 2) A(Bb) = 
3) (A t + A 2 )B = A!JS + A 2 B, 4) (aA)J5 = 

We notice that V" and F are naturally paired into fc. 

Our task is to study V" more closely and to investigate general 
pairings. 

Let {Ai} be a basis of the right space F where the indices i range 
over some set / which we have used for indexing. Let <p e ^, and put 
<pAi = a,- e fc. An element X e F can be written uniquely in the form 
X = ]T)f A t -z t - where x< 9* holds only for & finite number of indices, 
since X must be & finite linear combination of {A,-}. Then 
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<f>X = 2 <pA t Xi = X) fl.* 


This shows that <p is known if all the a< are known. Select conversely 
an a t - e k for each index i, not restricted by any condition and define 
a function <p(X) by 



The sum on the right side makes sense since x t j holds only for a 
finite number of the subscripts i. The two equations <p(X + Y) = 
<t>(X) + >(F) and <p(Xb) = <p(X)b are immediately checked. Since 
A i = X" A v $ vi 9 where, as usual, d it = 1 and d fi = for j 7* z, we 
get also <p(A{) = 2* M,t = a. Thus we have 

THEOREM 1.8. // {A { } is a basis of V, then for randomly given 
a c k (one for each i) there is one and only one <p e ^ such that <pAi = a t - . 

Let i be one of the subscripts. Denote by <p { the functional for 
which <p t Ai = 1 and > t A,- = for i 7* j. In other words <p v A f = d if . 

A finite linear combination of the <p f has the form <p = J^ t - & t ^< 
where 6 ^ holds only for a finite number of i. The <p which we 
obtain in this way form the subspace W Q of V^ which is spanned by 
the <pi . For such a <p we get 



This means that <p alone already determines the 6,- as values of (p 
on the basis vectors. The linear combination 2 &> i s > therefore, 
unique, the <p t are independent and consequently a basis of W Q . 
Since there are as many ^> t as there are A< , we get dim W = dim V. 
WQ can be described as the set of those functional <p of V for which 
<pAi 7* holds only for a finite number of i. Should dim V be finite, 
then W Q contains obviously all functional of V, i.e., WQ = V' and 
we get dim V = dim t^. If dim V = , then dim W = < and, 
since W Q C ? and our notion of dimension is very crude, we also 
have dim V' = < . WQ is then certainly not the whole space V. If 
A = 2* ^ a ^ au d -4. 5^ 0, then at least one a,- 5^ 0. For this j 
we get <pi A ^ 0. By the definition of a functional we know trivially 
that only the zero functional vanishes on all of V. Now we see an 
analogue: If A e V, and if <pA =0 for all <p t V, then 4=0. 
Let us now state our results. 
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THEOREM 1.9. We always have dim V = dim V. If we know 
<pA = 0/or all A, then <p = 0; if we know <?A = 0/or all <p, then A = 0. 
Let dim V = n be finite. To a given basis \A t \ of V we can find a 
"dual basis" {<p % } of V where <p.Ai = d ti . 

Turning our attention to pairings we suppose that a pairing of the 
left space W and the right space V into k is given. 

DEFINITION 1.7. If A t W, B e V and AB = we shall say that 
A is orthogonal to B. If W is a subspace of W and V Q a subspace of V, 
we say that W is orthogonal to V Q provided AB = f or A t W Q 
and B t V . If F is a subspace of V, one sees easily that the set V% 
of all vectors in W which are orthogonal to V is a subspace of 
W : V H C W. Similarly each given subspace W of W gives rise to a 
subspace W% of V. We have trivially V C (V*)*- Abbreviating we 
write F?* instead of (F?)*. Of special importance is the subspace 
F* of W consisting of all vectors of W which are orthogonal to all 
vectors of F. We shall call F* the left kernel of our pairing. Similarly 
we call the subspace W* of F the right kernel of our pairing. 

Notice that Theorem 1.9 tells us that in the pairing of F and F 
both kernels are 0. 

Suppose that in our pairing of W and F into fc the left kernel is 0. 
Each A t W gives us a function <p A on F with values in fc, namely 

^(JQ = AX. 

The function <p A is easily seen to be a homomorphism F fc, 
i.e., <p A t V. 

This suggests studying the map W > V which maps the vector 
A onto <p A . We have 

* M X = (A+ E)X = AX + BX = (^ 
and 

<p oA X = (aA)X = a(AX) = a<f> A X. 



The two equations ^+/? = <PA + <PB and ^> aA = &PA mean that our 
map W > ? is a homomorphism. The kernel of this map consists 
of those A c TF f or which <p A is the zero function: AX = for all 
X t V. However we had assumed that the left kernel of our pairing 
is zero. Hence A = 0. Our map W > V is, therefore, an isomorphism 
into (not always onto). 
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Similarly, if the right kernel is 0, one obtains an isomorphism into: 
V -+TF. 

Suppose again that the left kernel is 0. Let W be a subspace of 
W and W$ the subspace of V which is orthogonal to W Q . We can 
find in a natural way a new pairing of the space W Q and the space 
V/W* into k by defining as product of a vector X e W and of a 
coset Y + W* in F/TF? the element XY of k: 

X-(Y + TT?) = JTF. 

This pairing is well defined. Indeed Y + W% = Y l + W* Q implies 
Y - Y l e W*<> and since X is in W we have X(Y - FO = or 
XY = ^F! . That this new multiplication satisfies the axioms of a 
pairing is obvious. What is the right kernel? Y + W* will be in the 
right kernel if XY = for all X e W . This means Y e W* and, 
therefore, Y + W*<> = W* Q , the zero element of the space V/W* . 
The right kernel of our new pairing is 0. We can use the previously 
established method to construct an isomorphism into (canonical) : 



V/W 






What is this map? Given an element A + W* Q of V/W* Q ; the func- 
tional of W Q associated with it is 

X- (A + W*,) = XA (X ranging over TF ). 

If F is a given subspace of V, we can also define a natural pairing 
of VI and V/V Q by setting 

X-(Y+ Fo) = XY, XtV* Q , Y+ F e F/F . 

As before this pairing is well defined and satisfies the axioms. 
This time we ask for the left kernel; X will lie in the left kernel if 
XY = for all Y t V. But since we have assumed that the left kernel 
of our original pairing is 0, this means that the left kernel of our new 
pairing is 0. We obtain, therefore, an isomorphism into: 

VI - F/Fo . 

The inexperienced reader is urged not to give up but to go over 
all definitions and mappings again and again until he sees everything 
in full clarity. We formulate our results: 

THEOREM 1.10. Let W and V be paired into k and assume that 
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the left kernel F* of our pairing is 0. Let W Q be a subspace of W and 
F a subspace of F. There exist natural isomorphisms into: 

(1.6) 



(1.7) F->F/F O . 

Since these maps are isomorphisms into, the dimensions of the 
spaces on the left are < to the dimensions of the spaces on the right. 
Under the symbol dim the ~ on top of a space can be dropped because 
of Theorem 1.9. We obtain dim V/W* Q < dim TF and dim V* Q < 
dim F/Fo , inequalities which we can also write in the form 

codim TF$ < dim W and dim F? < codim F . 

If We put F = W* Q in the second inequality and combine it with 
the first, we get 

dim Fo** < codim W ? < dim W . 

And now we remember the trivial fact that W Q C W* * so that 
dim W < dim W** . Therefore we get equality 

(1.8) dim TF ** = codim W* Q = dim W . 

The main significance of this formula appears in the case when 
dim W is finite. Since W C W$* , we see that simply TF* = W . 
In the isomorphism (1 .6) both spaces have the same finite dimension, 
our map is, therefore, onto and thus V/W* Q may be regarded naturally 
as the dual of W Q . The map (1.7) should be used for F = W% and 

becomes an isomorphism of W into V/W* Q . This map is onto again 
and we see now that each of the spaces W and V/W% is naturally 
the dual of the other. 

In (1.8) no restriction on W Q is necessary; we can use the formula 
on Wo = W. We obtain 

(1.9) codim W* = dim W. 

We should mention that our results are true for all subspaces of 
W if dim W itself is finite. 

What can we do if the left kernel F* is not 0? We make a new 
pairing (by now you are used to this procedure) between W/V* 
and F by defining 

(X + F*)F = XY, X + F* e TF/F*, 7c F. 
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The element X + V* lies in the left kernel if XY = for all Y e V. 
This means X e F*, and hence X + F* = F*, the zero element of 
F/F*. The left kernel is now zero. The right kernel is obviously 
the old W*. Equation (1.9) tells us that the codimension of W* in 
F is the same as the dimension of our left factor which is W/V*. 
We obtain, therefore, 

(1.10) dim V/W* = dim W/V*. 

Suppose now that both kernels F* and W* are zero and that dim W 
is finite. (1.10) shows that dim F is also finite and equals dim W. 
In this case we can use all our results unrestrictedly on subspaces 
of both W and F. If we choose for instance W Q = W, then W*Q = 
and F/WO = F; we see that each of the spaces W and F is naturally 
the dual of the other. The reader should again visualize the map: 
A in W corresponds to the functional AX = <p A X and A > <p A is 
one-to-one onto. Let us still have a look at the correspondence 
W - W* Q of a subspace W Q C W and the subspace W\ C F. Any 
subspace F C F is obtainable from a TF ; we have merely to put 
W Q = V*Q . And distinct subspaces of W give distinct images since 
W*Q = W\ implies (by starring) W = W l . It is therefore a one-to-one 
correspondence which is lattice inverting, i.e., an inclusion W Q C Wi 
implies W* D W* (a strict inclusion becomes strict again). Again let 
us collect our results. 

THEOREM 1.11. Assume W and V paired into k: 

a) dim W/V* = dim V/W*', in particular, if one of the spaces 
W/V* and V/W* is finite dimensional, the other one is also, and the 
dimensions are equal. 

b) // the left kernel is and W C W, then 

(1.11) dim W Q = codim W* Q = dim W*.*. 

If dim WQ is finite, then W** = W Q and each of the spaces W Q and 
V/W% is naturally the dual of the other. 

c) // both kernels are and dim W is finite, then each of the spaces 
W and V is naturally the dual of the other. The correspondence W <-> W% 
is one-to-one between the subspaces of W and the subspaces of V. It 
reverses any inclusion relation (in the strict sense). 

In case our pairing is the one between tf and F we can strengthen 
the results. We know already that both kernels are 0. Let F be any 
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subspace of V (not necessarily of finite dimension) and consider the 

map (1.7) of Theorem 1.10 : F* > V/V Q . It is an isomorphism iito 
and we shall show that it is onto. Let <p be any functional of F/V - 
We construct a function & on the space V by defining 



Then 
MX + F) = <p(X + Y + Fo) = X(X + Fo) + (Y + 

XX + Fo) + v(Y + Fo) = XX) + 
and 

F )a) = <p(X + F )a 



This implies that is a functional of F. Suppose X e F , then 
XX) = X* + Fo) = X^o) = Xaero element of F/F ) = 0. 
<p vanishes on all of F and, therefore, belongs to F? . We contend 
that has the given y as image under the isomorphism (1.7) and 
this will show the ontoness of our map. What is its image? We had 
to make the new pairing of FH and F/F by defining Y (X + F ) = 
YX (Y t V* Q , X e F); for Y = $ t V% this means (X + F ) = &X 
and this function & (X + F ) on F/ F is the image. But &X was 
by definition y(X + F ) so that this function on F/F is indeed ^?. 
The space F? is now naturally the dual of F/F . Let A $ F . 
The coset A + V is not the zero element of F/F so that a functional 

<p e F/F can be found such that <p(A + F ) 7* (Theorem 1.9). 
The corresponding & t V^ gives then &(A) = <p(A + F ) 5^ 0. This 
vector A is, therefore, not orthogonal to all of F? and we conclude 
that a vector orthogonal to all of F * must, by necessity, lie in F . 
Since F is trivially orthogonal to F? we see that we have also 
proved F* * = F . Finally, we set in formula (1.11) of Theorem 1.11: 
dim TF = codim W% , for W Q the space F * . Since Wl = F , we 
obtain dim F * = codim F which supplements the analogue of (1. 11), 
dim F = codim F * , which is true in all pairings. Thus we see in our 
special case that there is a one-to-one correspondence W Q <-> Wf> 
between subspaces W Q C W of finite dimension and subspaces of 
F with finite codimension. Indeed, if codim F is finite, then dim F? 
= codim Fo is finite and W = V* is the space which gives W* = F ; 
there is only one such space. If W^ is given and of finite dimension, 
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then (1.11) shows codim W*<> = dim W Q and W = VI gives V 

w* 

W o . 

THEOREM 1.12. Consider the pairing between fi = W and V. 
If V is any subspace of V, then F?* = V and V* Q is naturally the 
dual of F/Fo . We have not only dim V = codim VH (established in 
Theorem 1.11), but also dim F* = codim V . The correspondence 
W Q <-> W * is one-to-one between all subspaces W C W of finite dimen- 
sion and all subspaces of V with finite codimension. Similar results 
would not hold in general for subspaces of V. 

Let us look especially at a hyperplane F of V. Then codim V = 1, 
hence dim V% = 1. Let VI = <^>) (<p 7* 0). Since F?* = F we see 
that the vectors X of F can be characterized as the solutions of the 
equation <pX = 0. Any \[/ e tf such that \I/X = for all X e F must 
lie in F * and is therefore a left multiple of <p. If we start with any 
V 76 arid put W = (#} then dim W = 1 and hence codim IF? = 1. 
The solutions of <f>X = form a hyperplane. 

The proof of these simple facts about hyperplanes is burdened 
by too much theory. Let us see whether we can not get them from 
scratch: 

Take a functional <p 7* of F. Map F > k by sending X > <pX. 
This is a homomorphism by definition of a functional. Since ^ 5^ 
there is some non-zero image b and since k (as right k space) is 
1 -dimensional, the map is onto. Let F be the kernel. Then F/F ^ k, 
whence dim F/F = codim F = 1. Start conversely with a hyper- 
plane F . Consider the canonical map F > F/F with kernel F . 
F/Fo is 1-dimensional, hence isomorphic (not canonical) to k. The 
map F F/Fo > k is then a functional v? with kernel F . Take 
any vector A 4 F and two functionals ^>, ^ with kernel F . Then 
<p(A) = a 5^ 0, ^(A) = 6 7* and the functional <p ab~ l \l/ will 
map F and A onto 0. It vanishes on F and is therefore : <p = ab~ V- 

THEOUEM 1.13. A hyperplane F o/ F can be described as the set 
of solutions <pX = where <p 7* is an element of V and conversely 
any y j of V r gives in this way a hyperplane. <p is determined by F 
up to a left factor 9* of k. 

5. Linear equations 

We illustrate the theory developed thus far by applying it to the 
theory of linear equations. The beginner should not expect that we 
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will be able to develop miracle methods for solving equations. An 
actual solution is still best found by the elementary method of suc- 
cessive elimination. 
Let 

+ a ln x n = b l , 



be a system of ra equations with n unknown o\ whose coefficients 
a,, and right sides 6 t are given elements of a field k. 
The following objects will play a role in the discussion: 

1) The matrix (a,,) of the coefficients with m rows and n columns. 

2) A right n-dimensional vector space V over k and a fixed basis 
E l , E 2 , - , E n of V. A solution (x l , z 2 , , z n ) of (1. 12) shall be 
interpreted in V by the "solution vector" X = E&i + E 2 x 2 + + 
E n x n . 

3) The dual space V' with the basis ^ , <f> 2 , , y? n dual to 
, , E 2 , , E n . With the f-th row of (a t; ) we associate the functional 

- , m) . 



If X = #!#! + JBJ 2 o: 2 + + A T n x n is any vector of V then 



(this follows easily from ^ t /?,- = 5 tJ ). 

4) The subspace W of ^ spanned by ^ , ^ 2 , , ^m - Its dimen- 
sion r is the maximal number of linearly independent vectors among 
the \l/i and consequently the maximal number of left linearly inde- 
pendent row vectors of (a,/). We call r therefore the left row rank 
of (a t/ ). 

5) The m-dimensional right space S m of m-tuples of elements of k. 
In it we place the n column vectors A l , A 2 , , A n of the matrix 
(a,-,) and also the vector B = (bi , b z , , 6 m ). The vectors 
A! , A 2 , - , A n will span a subspace C7 of S OT and its dimension 
shall be called the right column rank of the matrix (,). 

In our setup the equations (1.12) can be rewritten in two ways. 
In the space S m they are obviously equivalent with the vector equa- 
tion 
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(1.13) A,x, + A 2 x 2 + + A n x n = B. 
But they can also be written as \f/ t X = b, , or as 

(1.14) B = (fcZ, * 2 X, , iMT), X e 7. 

Our equations need not have a solution. The problem of solva- 
bility is stated in the following way: Consider the matrix (a</) as 
given and fixed. For which vectors B does a solution exist? 

Equation (1.13) tells us that the vectors B for which a solution 
exists must lie in the subspace U of S m . 

Equation (1.14) tells us that B must lie in the image space of the 
following map of V -A S m : 

f(X) = (fcX, * 2 X, , fJQ, XtV. 

This map / is obviously a homomorphism. Its kernel consists of 
those vectors X for which ^,X = (i = 1, , m), hence of the 
vectors X which are orthogonal to \l/i , \f/ 2 , , $ m and, therefore, 
orthogonal to the space W C ^ which they span. The kernel is 
therefore W*. We see that the image space U ^ V/W*, hence 
dim U = codim W*. But codim W* = dim W^, and we obtain 
dim U = dim W. We have therefore the rule: 

left row rank of (a t; ) = right column rank of (a,/), 

a rule which facilitates sometimes the computation of ranks. 

In most applications the answer we have found thus far is satis- 
factory: The equations have a solution if and only if B belongs to 
a certain r-dimensional subspace U of S m , the point being that we 
know now that r is the left row rank of (a,,). 

Frequently one asks when the equations have a solution for all 
B t S m . This means U = S m or r = m, and is true if and only if the 
rows of (a M ) are left linearly independent. 

Suppose now that B is of such a nature that the equations have a 
solution and let X Q be a special solution: ^vY = 6 t (i 1, 2, - , m). 
Then the equations can be rewritten as 

*.(X - Z ) = 

and mean that X X must belong to the subspace W* of V. The 
solutions consist of the coset X + M 7 *. We have, therefore, the 
familiar rule: general solution = special solution + general solution 
of the homogeneous equations. 
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When is the solution unique? This means W* = and conse- 
quently W = t^; therefore, r = n. Let us review the extreme cases 
for r: 

r = m means solvability for all B; 
r = n means uniqueness of the solution, if it exists. 

Should m = n (the frequent case of as many equations as there 
are unknowns), then r = n means solvability for all B as well as 
uniqueness of the solution, and is also equivalent with W* = 
which implies that the homogeneous equations have only the trivial 
solution. This fact is used frequently in applications. 

The description X Q + TF* which we have given for the solutions 
calls for a geometric language. 

Let V Q be any subspace of V. A coset A + F shall be called a 
linear variety. If the coset A + F is merely given as a set of vectors, 
then one can get back the subspace F by subtracting from each 
vector of A + F a special one, say A. The subspace F shall be 
called the direction of the linear variety. (The intuitive picture is, 
of course, to think of vectors as "arrows" from the origin and to 
think of the linear variety A + F as consisting of the endpoints of 
every "arrow" in A + F .) We can now say that the solutions of 
our equations form a linear variety which passes through a special 
solution and has the direction W*. If A + F is a linear variety, 
then dim F shall be called the dimension of the linear variety. For 
the solutions of our equations the dimension is dim W* = codim W 
and, therefore, n r. 

One may also ask for the converse. Given a linear variety A + F . 
What can be said about all equations \I/X = b ($ e V") which are 
satisfied by all X t A + F ? They must be satisfied by A, which 
gives b = $(A), and thus they have the form $(X A) = or 
^(l^o) = 0; $ must, therefore, belong to F? . Any ^ e Ft gives really 
an equation: set b = \f/A\ since \f/(X A) = 0, if X e A + F , we 
get \[/(X) = b. If we put r = dim V% and let ^ , ^ 2 , , \[/ r be a 
basis of F? , then \1/ { X = ^ t A (i = 1, , r) is a set of linear equations 
and their solution satisfies $i(X A) = (i = 1, - , r) so that 
X A z F?* = Fo ; A + Fo is the set of solutions, and dim F = 
codim Ft = n r. 

The elementary geometry of linear varieties is very simple. Let 
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LI = A + Vi and L 2 = B + F 2 be two linear varieties with direc- 
tions Vi and F 2 . They need not intersect. When is L t H L 2 not 
empty? If and only if a vector X l t V l and a vector X 2 t F 2 exist 
such that A+X l = B + X 2 orA-B = (--YO + X 2 . This means 
A J3 e V l + F 2 . If L! r\ Z/2 is not empty and C a common vector, 
then we can write LI = C + V l , L 2 = C + F 2 , hence LX r\ Z/ 2 =* 
C + (F! n 7 a ). The direction of L t n L 2 is, therefore, V l H F 2 . 
Its dimension (still for non-empty L! C\ L 2 ) is that of FI O F 2 . 
We must also consider the "join" L t o L 2 of Lj and L 2 , the smallest 
linear variety containing both L t and L 2 . Its direction must contain 
A B and all differences of V l and F 2 , hence all vectors of 
(A B) + Vi + F 2 . The join contains B, hence certainly B + 
(A B) + FI + F 2 . But this is a linear variety and contains L l 
and L 2 , hence 

L! o L 2 = 5 + (A - 5) + F, + F a . 

Its direction is (A B) + F! + F 2 . We notice that there is a case 
distinction: 

1) Lj P\ L 2 is not empty, then A B t V l + F 2 and 

L! o L 2 = B + V, + V 2 , dim(L, o L 2 ) = dim(F, + F 2 ). 
We obtain 

dim(Li n L 2 ) + dim(Z/! o L 2 ) = dim L, + dim L 2 . 

2) L! Pi L a is empty. Then A - B 4 V 1 + V 2 and 

o L a ) = 1 + dim(F r + F 2 ). 



Let us illustrate this in a few special cases. If dim L = dim L 2 = 0, 
then L! = A, L 2 = B, F! = F 2 = 0; if L v ? L 2 , then they do not 
meet and 

dim(L! o L 2 ) = 1 ; 

Lj o L 2 is the unique "line" through A and B. 

Suppose n = dim F = 2, dim L^ = dim L 2 = 1. If L l C\ L 2 is 
empty we have dim(Li o L 2 ) = 1 + dim(F! + F 2 ) < 2, hence 
dim(F! + F 2 ) = 1, V l = F 2 (parallel lines). 

If Lj C\ L 2 is not empty, then 

dim(L 1 n L a ) + dim^ o L a ) = 2. 
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If L! 5* L 2 , then dim(Lj o L 2 ) > 1, hence din^Lj C\ L 2 ) = 0, 
L! C\L 2 a "point". 

6. Suggestions for an exercise 

The reader will come to a clearer understanding of the content of 
2-4 if he works out by himself the analogue for ordinary additively 
written abelian groups: 

Let R be the additive group of real numbers and Z the subgroup 
of the ordinary integers. We must first familiarize ourselves with the 
factor group R/Z. The elements of R/Z are the cosets a + Z with 
a e R. Two cosets, a + Z and b + Z, are equal if and only if a b is 
an integer. To simplify notations it shall be understood that we 
describe a + Z by merely giving a (where a is only defined up to an 
integer). Such a coset may be multiplied by an integer (this can be 
done in any additive group) but a product of two cosets is not well 
defined. One of the main properties of R/Z is that it is "divisible": 
To an element a t R/Z and an integer n > one can find an element 
b such that nb = a. However, this & is not uniquely defined. Aside 
from the coset given by a/n one has also the cosets a/n + i/n for 
0<i<nlasa possible b. 

The reader should now go over our whole exposition on vector 
spaces, replacing everywhere the word vector space by abelian group, 
subspace by subgroup and dimension by order of the group. Whenever 
a plus sign occurs between dimensions it has to be replaced by a 
product sign. Any reference to the field k has to be dropped. We keep 
the notations for easier comparison: V now means additive group, 
dim V its order. We have no difficulty with the symbol (S) but 
disregard the notion of independence of a set. By a basis of a finite 
abelian group V we mean elements A l , A 2 , , A r of V whose 
orders are e 1 , e 2 , , e r e Z such that the A< generate V and such 
that m l A 1 + m 2 A 2 + - + m r A r = (w< E Z) if and only if each 
nti is a multiple of e t . The reader may consult any book on group 
theory for the proof that a finite abelian group has a basis. The 
notion of a basis does not work any longer for infinite groups and 
has to be abandoned. Consequently we can not prove Theorem 1.3 
and prove directly that for U 1 C U 2 C U 3 the analogue of equation 
(1.1), namely 

dim C/a/C/i = dim U./^dim U,/U 2 , 
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holds. We can now prove Theorem 1.4. In 3 we need only the fact 
that Hom(F, V) can be made into an abelian group; then we immedi- 
ately go over to 4. 

It is in the analogue of Definition 1.5 that the special group R/Z 
comes into play. We define F = Horn (7, R/Z) and talk in Definition 
1.6 of a pairing of abelian groups W and V into R/Z. 

If V is finite with the basis A { (e t the order of A 4 ), let <p t V. Then 
<pAi = a, c R/Z determines <p. In R/Z the a can, however, not be 
freely selected since e t A { = 0, and, consequently, e % a { (zero 
element of R/Z). This restricts a t to an element of the form m/e t 
with < m < g t 1. Within this restriction we have a free choice. 
This allows us to define a dual basis ^> t by letting <p l A i = l/e,--5 t ,- . 
It turns out that the <p t are a basis of V" with exactly the same 
orders e { . Therefore V' c^ V although the isomorphism is not canon- 
ical, since it depends on the choice of a basis for V. For a finite V 
one has, therefore, dim V = dim V'. 

To handle infinite groups the reader should try to prove the 
following lemma: Let U be a proper subgroup of V, X t V, X 4 U. 
Let <f> e U. Then <p can be extended to a functional on the group gen- 
erated by U and the element X consisting of all elements F + mX, 
YtU,mtZ. He should try the definition <p(Y + mX) = <p(Y) + ma 
with a suitable a t R/Z. How should a be selected so as to make the 
map well defined? In how many ways can one do it? Can one do it 
with an a ^ 0? Now one uses a transfinite argument to prove that <p 
can be extended to the whole group V. The reader will have clear 
sailing up to the end (i.e., Theorem 1.12). 

The notion of the dual of a space pervades all of modern mathe- 
matics. The reader will meet it again in the theory of Banach spaces 
and other topics of analysis. 

7. Notions oj group theory 

Let G be a group, not necessarily commutative, and S any subset 
ofG. 

DEFINITION 1.8. The set of all x t G for which the set xS is the 
same as the set Sx is called the normalizer N s of the set S. The set 
of all x c G for which xs = sx for all s t S is called the centralizer Z s 
of S. The set of all x e G which commute with all elements of G, 
in other words the centralizer Z of G is called the center of G. 
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N s and Z R are subgroups of G and the center is a commutative 
invariant (normal) subgroup of (?. 

Let / be a homomorphism of G into some other group. When is the 
image group f(G) commutative? /(a)/(b) = /(&)/() is equivalent 
with /(aba'V 1 ) = 1. If K is the kernel of / it moans aba'V 1 e K . 
The element abcT'b" 1 is called the commutator of a and b. Its inverse 
is bab~ l a~ l , the commutator of b and a. The kernel K contains all 
commutators. Conversely, let H be any subgroup of G which contains 
all commutators. If x t G and h e H, then xh = xhx~ l h~ l -h-x e Hx, 
hence xH C Hx and similarly Hx C xH. Such an H is automatically 
invariant and since the kernel H of the canonical map G > G/H 
contains all commutators, the factor group G/H will be abelian. 
The smallest subgroup (?' of G which we can form in this way con- 
sists of all products of commutators and (?' C K is another way of 
expressing the condition we found. 

DEFINITION 1.9. The set of all products of commutators of G 
is called the commutator subgroup G' of G. The factor group G/G' 
is abelian and the image of G under a homomorphism / will be an 
abelian group if and only if G' is contained in the kernel K of /. 
A subgroup H of G which contains 0' is necessarily invariant and 
G/H commutative. 

Let e be any element of G and consider the following map 
v e :G-+G: 

<p e (x) = exc" 1 . 



We have <p e (xy) = cxyc~ l cxc" 1 eye" 1 = <f> c (x)<p f (y) so that <p c is 
a homomorphism. 

<f> c (<f>d(x)) = c(dzd" V 1 = cdx(cd)~ l = <f> e d(x), 

in other words <f> c <p d = <p ed . The map ^ is the identity. If x t G is 
given, then x = ^(x) = ^(^-'(2)) which shows that <p e is onto. 
If <p e (x) = 1, then <f> c -i(<p e (x)) = 1 or x = 1. The kernel is 1, each ^ fl 
is an isomorphism of G onto (?, in other words an automorphism of G. 
This particular type of automorphism <p e is called an inner auto- 
morphism of G. Since ^ c ^ = <p e <t and v?i == 1, we see that the inner 
automorphisms of G form a group I G . 

Consider the map G -* I Q given by e > <p e . It is onto by definition 
and is a homomorphism. The element c will be in the kernel if <p e = 1, 
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cxc~* = x for all x e G. The kernel of this map is the center of (?. 
We see that 

I Q ~ G/Z . 

Let us call two subsets S and T of G equivalent if T is the image 
of S under some <? e : T = ? C (S). Then /S = *v-.(T); and if T and C7 
are equivalent, U = ^(T), then U = vWe(>S) = ^ C (S) so that 
[7 and S are equivalent. How many sets are equivalent to a given S? 
We have to decide: when is <p e (S) = <?*(&) , or S = <f> c -* d (S). But 
S = <p a (S) means S = aSaT 1 or Sa = aS or a e N s . Thus we have: 
<f> c (S) = ^d(S) is equivalent to c~ l d t N s , d t cN s . All elements d 
of the left coset cN s will give the same image <p d (S). The number of 
distinct images <p a (S) is, therefore, equal to the number of left cosets 
cN s of N s . The number (G : H) of left cosets cH which a subgroup 
H has in G is called the index of H in G. Thus we have proved: The 
number of distinct sets <p e (S) = cSc" 1 , which we will get when c 
ranges over (?, is equal to the index (G : N s ) of the normaliser of S. 

Of special importance is the case where S consists of one element 
only: S = {a}. Then each <? e (S) contains only the element cac" 1 . 
The elements of G are, therefore, partitioned into equivalence classes: 
a is equivalent to all cac~ l . The number of elements in the equivalence 
class of a is (G:N a ). Which of these equivalence classes contain 
one element only? For such an a we must have a = cac~ l for all 
c t G. They are, therefore, the elements a of the center Z G of G. 
Denoting by #(S) the number of elements in a set S we have, there- 
fore, #(#<?) equivalence classes which contain one element only. 
Counting the number of elements in G by equivalence classes leads 
to a formula, 

(1.15) #(G) = #(Z ) + E (G : #), 

a 

where ]T) means only rather vaguely "sum over certain a" but 
where each (G : N a ) > 1 since those cases where (G : N a ) = 1 have 
already been counted in #(Z ). 

Although the following application is not needed, we can not 
resist the temptation of mentioning it. 

Let G be a group whose order #((?) = p r (r > 1) is a power of a 
prime p. Then each term of 2 is a power of p since (G : N a ) divides 
p r . Since each term is > 1, the whole ]^ a is divisible by p. Therefore 
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#(Z ) = #(? J^ a is divisible by p. We have the famous theorem 
that the order of Z is > 1, Z is not just identity. 

If we have a multiplicative group G we may adjoin to it a zero 
element such that a = a = f or any a in G and also for a = 0, 
and obtain a new set which is not a group but shall be called rather 
inaccurately a group with 0-element. 

DEFINITION 1.10. By a * 'group with element" we shall mean a 
union of an ordinary group and an element such that a = a = 
for all a in the set. 

We have now to describe what we mean by an ordered group: 
It shall be a group G with an additional binary relation a < b satisfy- 
ing all laws one would like to have true. Denote by S the set of all 
a > 1. We would like to conclude from a > 1, 6 > 1 that ab > 1. 
S will have to be closed under multiplication. One also would like 
to conclude that a > 1 is equivalent with a~ l < 1 and certainly 
exclude a = 1. Furthermore, we would like to have each element 
either > I or = 1 or < 1. This shows that we should postulate 
that G be the disjoint union S' 1 W {1} W S where /S" 1 denotes the 
set of inverses of S. Finally a > 1 should imply ba > 6; multiplying 
on the right by b' 1 we would like to get bob' 1 > 1. Let us try this 
as a definition. 

DEFINITION 1.11. A group G is said to be ordered if a set S is 
singled out with the properties: 

1) G is the disjoint union S~ l U {1} U S, 

2) S-S C S (closed under multiplication), 

3) for any 6 t G we have bSb' 1 C S. 

Applying 3) for fe" 1 we get b~*Sb C Sor S C bSb' 1 and, therefore, 
bSb- 1 = S. 

We define a > b as meaning b l a E S. This gives b>(b" l a)b l e S 
or a&~ 1 e S. It is, therefore, irrelevant whether one says b~ 1 a e S 
or ab" 1 t S. Notice that a > 1 really means a e S as we would like 
it to be. 

Given any pair a, b. If b~ l a e S, then a > 6; if b~ 1 a = 1, then 
a = 5; if 6~ l a e S" 1 , then a~ l b e $, hence b > a and these possibilities 
exclude each other. Any two elements are " comparable' \ 

Suppose a > b and b > c, then ab~ l e S and be" 1 e S, hence 
aV l -bc~ l = ac~ l e S and we get a > c, the transitivity of >. 

Suppose a > b and let c be any element in (?; then b" 1 a c S and 
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ab~ l e S and consequently (cb)~ l (ca) = b~ l a e S and ac(bc)~ l = 
alT 1 e S which means that a > b implies ca > cb as well as ac > be. 
One can multiply an inequality by c. 

If a > b and c > d then ac > ad and ad > bd, hence by transitivity 
ac > bd. One can multiply two inequalities. 

If a > b then b~ l > a" 1 , since this means ab~ l e S. 

All intuitive laws for inequalities are satisfied. 

DEFINITION 1.12. A "group with 0" = U G is called ordered 
if the ordinary group G is ordered in the sense of Definition 1.11 and 
if is defined to be < than any element of G. 

Again all intuitive laws are true. 

8. Notions of field theory 

We have remarked repeatedly that a field k need not have com- 
mutative multiplication but the addition is of course commutative. 
The set of non-zero elements of k forms a multiplicative group which 
shall be denoted by k*. 

As in any additive group we can multiply an element a t k by an 
ordinary integer n t Z (Z shall denote the set of ordinary integers) 
and get an element na t k. In any additive commutative group 
one has the rules (n + in) a = na + ma, n(ma) = (nw)a, n(a + b) = 
na + nb. In k one sees easily the additional rule 

(nd)(mb) = (nra)(ab). 

For instance if n and ra are > the left side is 
( a + a + . . .)( b + b + . . .) 

and upon expansion of this product one gets the right side. 

Let a 7* 0. The question whether na = does not depend on a 
since this is equivalent with na-a~ l = or ne = where e is the 
unit element of k. This equation does certainly not hold f or n = 1. 
If we map the ring Z of integers into k by n > ne, then this map is a 
ring homomorphism. If ne 5^ whenever n 9^ 0, then it is an iso- 
mprphism; k contains in this case an isomorphic replica of Z and, since 
k is a field, an isomorphic replica of the field Q of rational numbers. A 
field fc of this type is called a field of characteristic 0. Suppose, on the 
other hand, that the map n>ne has a non-zero kernel H, a subgroup 
of the additive group Z. Such a subgroup 5* consists of all multiples 
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pv of the smallest positive integer p in H. As remarked earlier p j* 1. 
If p were not a prime number, then p = db with positive a, & which 
are < p. Since p-e = 0, we would get (ae)(be) = which is not 
true since k is a field. This number p is called the characteristic of a 
field of this type and is a prime number. 

The only distinct images of Z in fc are the p elements ve with 
< v < p 1. The p 1 non-zero elements among them are 
closed under multiplication and, since in a field the cancellation law 
holds, they form a group. The p elements ve form, therefore, a subfield 
Q 9 of fc (which is isomorphic to the field of residue classes Z/pZ of 
integers modulo p). From now on we will denote the unit element 
of fc by 1, the elements vl simply by v with the understanding that 
v is to be read modulo the characteristic of fc. 

If fc is a subfield of a field F, we may regard F as a left vector 
space over fc, taking as definition of the vector space operations the 
ones we already have in F. This space F has a dimension (over fc) 
which is called the left degree of F over fc and denoted by [F : fc]. 
One could of course also define a right degree; it is an unsolved problem 
to decide whether any connection exists between these two degrees. 
We shall stick consistently to the left degree. By a left fc-basis {a,} 
of F one means, therefore, a set of elements in F such that any 
element of F is a unique finite left linear combination of these basis 
elements with coefficients in fc. We may write ft = 2, x * a i where 
Xi E fc and where x { j& can hold only for a finite number of indices i. 

If F is a subfield of E and fc a subfield of F, let { T, } be a left 
F-basis of E and {a,-} a left fc-basis of F. Given an element A e E 
we can write A = ^/ 0/F/ w ^h Pt e F an d / 2 x u a 
x lf t k where only a finite number of the x,, are j& 0. We obtain 



i.i 

If, conversely, we have a sum 2./ x a a i^i an d call it 4, then 
A = 2,- (2< #t/.)r/ ; this A determines the coefficients 2< #<* 
of the F/ uniquely and each ]), #,,, determines the x it uniquely. 
This shows that the set {a t r/} is a fc-basis of E and we have proved 
the formula 

[E : fc] = [E : F][F : k]. 

If a t fc, then we call the set of all x t k for which xa = ax the 
normalizer N a of a. If x t N a and ysN a , then a: y e 2V. and XT/ e JV. 
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Should x 7* 0, then xa = ax implies oaT 1 = x~ l a, hence x~ l t N m . 
This proves that N a is a subfield of fc. Af* is the group theoretical 
normalizer of a in the group k*. The set of all x t k such that xy = yx 
for aH y t k forms also a field Z k and again we have that Z\ is the 
center of the group k*. We remark that trivially Z k C N m . 

A certain geometric problem is connected with fields k which 
contain only a finite number s of elements. The characteristic of 
such a field must be a prime p > 0. If k is a subfield of F 
and [F : fc] = r, then each element of F is obtained uniquely in the 
form 



where the a, form a fc-basis of F. Hence we see that F contains s r 
elements. 

Denote by Z k the center of k and let q be the number of elements 
in Z k . Call n = [k : Z k ] and let a be any element of k. Then Z A C 
N a Ck and we introduce the degrees: d a = [JV : Z k ] and e a = [k : N a ]. 
The relation n = d e a shows that d a is a divisor of n. 

We had called g the number of elements in Z k ; q n and g d * are, 
therefore, the number of elements in fc, N a , respectively, and the num- 
ber of elements in Z\ , N* a , fc* are q 1, q d * 1, q n 1, respectively. 
We apply now formula (1.15) of 7 to the group G = fc* and obtain 
a formula which looks like this: 

(1.16) 

where y^ d means vaguely a sum of terms, each one of the form 
(q n l)/(q d 1) and the same term possibly repeated. Indeed, 
(0:N*) = (q n - l)/(^ a - 1). The d should always be a divisor 
of n and be < n since each (G : N*) should be > 1. 

Our aim is to show that a formula like (1.16) can not exist if 
n > 1. If we succeed, we will have proved n = 1 and, therefore, 
fc = Z k . This would mean that fc itself is commutative. To give the 
proof we first need some facts about the so-called cyclotomic poly- 
nomials. 

It is well known that the polynomial x* 1 can be factored in 
the field of complex numbers: 

(LI?) *- - 1 - n (* - ) 



f-l-(q-l)+ 2; 
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wherfe ranges over the n-th roots of unity: c n = 1. If d is the precise 
order of e, then e is called a primalive d-th root of unity. Every d-th 
root of unity will appear among the n-th roots of unity if d is a 
divisor of n. Let us define 

*<(*) = II ( - ) 

where e shall range only over the primitive d-th roots of unity. 
Grouping the factors of (1.17) we find 

(i.i8) x- - 1 = n *x*> 

d|n 

where d ranges over all divisors of n. These polynomials 3> n (z) are 
the cyclotomic polynomials. The $ n (x) have obviously highest 
coefficient 1. We contend now that all coefficients of 3> n (x) are integers. 
Since &i(x) = x 1, this contention is true for n 1 and we may 
assume it is proved for all <f>j(x) with d < n. We know then that 
(1.18) has the form 

z n - 1 = *(*)/(*) 

where f(x) is the product of the factors ^j(x) with d < n and d\n. 
Therefore f(x) has integral coefficients and its highest coefficient is 1. 
The desired polynomial $ n (x) can, therefore, be obtained by dividing 
x n 1 by f(x). Remembering the way this quotient is computed 
and the fact that the highest coefficient of f(x) is 1, we see that 
& n (x) has integral coefficients. 

Let now d be a divisor of n but d < n. Then 

s - - i = n *,(*); 

aid 

each term $a(z) will appear as a factor on the right side of (1.18) 
since d\n. But d T* n] in the quotient (x n l)/(x d 1) the poly- 
nomial 3> n (z) will still be one of the factors. Thus we see that 3> n (x) 
divides x n I as well as (x n l)/(z d 1), 



. 
x 1 

and both f(x) and g(x) will have integral coefficients. If we set 
x = q, we see that the integer $(</) divides the two integers q n 1 
and (q n l)/(q* 1). With this information we turn to (1.16) 
and can conclude that the integer $ n (q) must divide q 1. 
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Now we estimate the size of the integer <((/). It is a product of 
terms q . The absolute value of q 6 is the distance of the point 
c on the unit circle arid the point q > 2 on the real axis. Each factor 
is certainly > 1 , even > q 1 , in absolute value and can be equal 
to q 1 only for = 1. This case e = 1 does not occur if n > 1 
since c is a primitive n-th root of unity. Thus certainly !$((?) I > q 1 
if n > 1. We see clearly that $(</) could not divide q 1. Thus we 
have proved the celebrated theorem of Wedderburn: 

THEOREM 1.14. Every field, with a finite number of elements is 
commutative. 

DEFINITION 1.13. Let / be a map of a field k into some field F 
which is one-to-one into and is a homomorphism for addition. 
If / satisfies 

(1.19) f(ab) = f(d)f(V) 

for all a, b e fc we call / an isomorphism of k into F. If / satisfies 

(1.20) f(ab) = 



for all a, b e k we call / an antiisomorphism of k into F. 

Let us remark that it suffices to assume that / is a homomorphism 
for addition, does not map all of k onto 0, and satisfies either (1.19) 
or (1.20). Indeed, if /(a) = for a single a 9* 0, then it would already 
follow that /(afc) 0. Since fc is a field, afc = fc, which contradicts 
our assumption. 

Hua has discovered a beautiful theorem which has a nice geometric 
application: 

THEOREM 1.15. // o- is a map of a field k into some field F which 
satisfies the following conditions: 

1) a is a homomorphism for addition j 

2) for a 7* we have 0(a~ l ) = (o-(a))" 1 ; i.e., we assume thai <r 
maps the inverse of an element onto the inverse of the image, 

3) er(l) = 1; 

then <r is either an isomorphism or an antiisomorphism of k into F. 

REMARK. Suppose a satisfies only conditions 1) and 2). Set 
a = 1 in 2), then x = er(l) satisfies x = of 1 , that is x 2 1 = 
(x l)(a? + 1) = 0. Thus, if o- does not satisfy condition 3), <r(l) 
= 1. If we put r(a) = tr(a), then T will satisfy all three conditions 
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which means that <r is either the negative of an isomorphism or the 
negative of an antiisomorphism. 

Proof of the theorem: Instead of a (a) we shall write a". Condition 2) 
reads now (a"" 1 )* = (a')" 1 for which we can write unambiguously 
a"'. Since a 7* Q implies a* -a"' = 1, we have a 9 j* which shows 
that the additive kernel of a is and our map, therefore, one-to-one 
into. 

We first establish an identity: 

Assume a, & t k; a, 6 ^ and a"" 1 7* b. Then the expression a~ l + 
(b' 1 a)"" 1 is well defined. Let us factor out a' 1 to the left and 
(IT 1 - a)" 1 to the right: 

a' 1 + (IT 1 - a)" 1 = a~ l ((b~ l - a) + a)(b~ l - a)- 1 

= a-V'OT 1 - a)" 1 . 
We can, therefore, take the inverse: 

(a" 1 + (b~ l - a)" 1 )' 1 = (b~ l - d)ba = a - aba. 
Thus we have 

a - (a~ l + (6" 1 - a)' 1 )- 1 = aba. 

If we apply a to the left side, conditions 1) and 2) allow us to 
interchange a each time with the operation of taking the inverse 
and we will end up with an expression like the one we started with, 
but a replaced by a 9 and 6 replaced by 6*. The left side becomes 
a'&'a' and we have shown that 

(1.21) (abaY = a'&V. 

(1.21) is also true if a or b are 0. If a"" 1 = 6, then ab = 1, the left 
side of (1.21) is a*. But a'Va* = a*a~ ff a* = a" which shows that 

(1.21) is true for all a, 6 e k. 
Set 6 = 1 in (1.21); 

(1.22) (a 2 )' = (aT- 
Replace a in (1.22) by a + b: 

(a* + ab + ba + 6')' = (a*)' + a'b' + b'a' + (&*)'. 
If we use (1.22) and condition 1) we finally obtain 

(1.23) (06)' + (ba)' = a'b' + b'a'. 
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Now comes the main trick. Let a, b 5^ and consider 

(1.24) ((ab)' - a'bO(abr'((ab)' - bV). 
Multiplying out (1.24) becomes equal to 

(1.25) (ab)' - bV - a'b' + a'b'(ab)~'b'a'. 

Use (1.21) on b'(ab)"'b' = (b(abr'b)', write a'(b'(ab)"b')a' - 



a'(b(dbr l bya' and use (1.21) again. We obtain 

(a-b(abr l b-ay = (ba)'. 
Thus (1.24) is equal to 

(ab)' - bV - a'b' + (ba)' 

which is zero by (1.23). Thus the product (1.24) is zero and one of 
the factors must vanish. 
We now know that 

Sa'b' 
or 
b'a' 

which is much nearer to what we want to show. For a or b = 0, 
(1.26) is trivially true. 

We ask whether the following situation could occur in k: can four 
elements a, b, c, d exist in k such that 

(ab)' = a'b' ^ bV, 
(1.27) 



We shall derive a contradiction from this assumption. 
Let x be any element in k and use (1.26) on a and b + x: 

(a'(b + x) 9 = a'b' + aV 
(1.28) (a(b + x)) 9 - < or 

((b + *) V = b'a' + V. 

The left side is (ab + ax)' = a'b' + (ax). If the first case of 
(1.28) happens we get (ax) 9 = a'x'. If the second case of (1.28) 
happens remember that a'b' 5^ b'a' so that certainly (ax) 9 7* x 9 a 9 . 
This means, by (1.26), that (ax) 9 = a'x'. We have, therefore, always 
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(ax)' = a'x'. The same method is used on the expressions ((a + x)b)', 
(c(d + x)) and ((c + x)d)'. Collecting all four cases the results are: 

(1.29) (oar)' = a'x', 

(1.30) (xbY = x'b', 

(1.31) (ex)' = x'c', 

(1.32) (xd)' = d'x'. 

Set x = d in (1.29) and x = a in (1.32); set also x = c in (1.30) 
and x = b in (1.31). We obtain 

(1.33) a'd' = d'a' and c'6" = 6V. 
Finally: 

(1.34) f (a + c)'(b + d)' = a'b' + a'd' + c'b' + c'f 
((a + c)(b + d)Y = < or 

((b + d)'(o + c)' = b'a' + d'a' + b'c' + dV. 
A direct computation of the left side gives: 

\ab) ~\~ \ad) ~\~ (co) ~\~ (ca) = a b ~{~ a d ~f* c b ~\~ d c . 



The first possibility on the right of (1.34) would give c'd ff = dV 
which contradicts (1.27). Using (1.33) for the second possibility we 
get aV = b ff a* and this contradicts (1.27) again. Our theorem is 
finally established; the fact that (1.27) can not happen obviously 
means that in (1.26) we either have the first possibility for all a, b e fc 
or else the second one for all a, 6 e k. 

9. Ordered fields 

DEFINITION 1.14. A field k is said to be ordered, if, first of all, 
it is ordered as an additive group. Rewriting Definition 1.11 in the 
additive notation (the third condition in 1.11 is not necessary since 
addition is commutative) this means that a set P of so-called "posi- 
tive" elements is singled out such that 

1) k = -P VJ {0} VJ P (disjoint), 

2) P + P C P (P is closed under addition). 

To these two additive conditions we add a multiplicative one: 

3) P-P C P (a product of positive elements is positive). 
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We define now a>6bya 6eP, mention again that a > is 
equivalent with a t P and, therefore, can use all consequences of 
Definition 1.11 of course in the additive form among them for 
example that a > b implies b > a. Let us now derive multipli- 
cative consequences. 

a) If c e P and a > 6, then a - b e P. Therefore, c(a - b) c P 
and (a 6)c e P, hence ca > cb and ac > be. If c e P, then 
c(a 6) e P, hence cb > ca and be > ac (inequalities are reversed 
ifO > c). 

b) If a > 0, then a- a > 0. If > a multiplication by a reverses 
the inequality, hence also in this case a - a > 0. Squares of non-zero 
elements are positive and especially 1 = I 2 > 0. Therefore > 1. 
We can write c" 1 c-(c~ 1 ) 2 and see that c > implies c" 1 > 0, 
> c implies > c~ l . 

Multiplying on the left by c j and on the right by c" 1 keeps 
the inequality: a > b implies cac" 1 > cbc~ l . The special cases 6 = 
and 6=1 show that cPc~ l C P and, if S is the set of elements 
a > 1, that cSc~ l C S. The set P is an invariant multiplicative 
subgroup of A;*; a > b is equivalent with ab~ l > 1, hence with 
ab~ l e S. If a E P and neither > 1 not = 1, then 1 > a. Multiplying 
by a" 1 gives a" 1 > 1 or a" 1 e S. 

The group P is the disjoint union S~ l ^J { 1 } VJ S and our ordering 
of k induces an ordering of the multiplicative group P. 

c) Since 1 e P, any sum of terms 1 is in P which shows that 
the characteristic of k must be 0. The ordinary positive integers 
are also positive in k. The ordering of A; induces, therefore, on the 
sub field Q of rational numbers the usual ordering. This proves also 
that the field of rational numbers can only be ordered in the usual 
way. 

In some geometric problems the ordering of a field arises in a 
quite different manner. We are led to another definition. 

DEFINITION 1.15. A field k is said to be weakly ordered if a 
binary relation a < b is defined in k such that the usual ordering 
properties hold. The connection to field operations presupposes, 
however, much weaker axioms: 

1) For any fixed a e k the ordering of k shall either be preserved 
or reversed under the map x x + a. (Notice that some a may 
preserve it, others may reverse it.) 

2) For any fixed a c ft* the ordering of k shall either be preserved 
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or reversed under the map x * xa. (Nothing is said about a map 
x ax.) 

If we say elements are in the arrangement a^ , o 2 , - , a n we mean 
that either a l < a 2 < < a n or a x > a 2 > > a . The axioms 
say that the maps preserve the arrangements. 

There is one freak case: the field with two elements 0, 1 has ob- 
viously only one arrangement so that any one-to-one map preserves 
it; the field can be weakly ordered. Suppose from now on that k 
has at least three elements. We prove certain facts about such a 
weakly ordered field. 

1) k can not have the characteristic 2. Let indeed 0, a, 6 be three 
distinct elements of k and suppose that A; is weakly ordered and has 
characteristic 2. 

a) Our elements are arranged as 0, a, 6. If we add a we get the 
arrangement a, 0, a + b and if we add b we get fc, a + &, 0. These 
two arrangements combined give a, 0, a + 6, 6 and this contradicts 
the original one. 

b) The arrangement is a, 0, b. Adding a and 6 we get 0, a, a + 6 
and a + fe, 6, whose combination gives either 0, a, b, a + b or 
0, 6, a, a + b contradicting the assumption. 

These two types exhaust the possibilities. 

2) fc has now a characteristic 7* 2. Let a e fc* and consider the 
possible arrangements of 0, a/2 and a/2. 

a) The arrangement is a/2, 0, a/2. Adding a/2 and a/2 we 
get the arrangements a, a/2, and 0, a/2, a which, together 
with the original one give 



b) The arrangement is 0, a/2, a/2. Adding a/2 and a/2 
gives a/2, a, and a/2, 0, a. Combined with the original one 
we get a, 0, a, a/2, a/2. (The case 0, a/2, a/2 is just a sign 
change in a.) 

Both these hypotheses lead to the arrangement 

a, 0, a 

which must, therefore, hold in any field k for any non-zero element. 
Adding a we also get 0, a, 2a. 

3) Suppose two elements are in the arrangement 
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0, a, b. 

Adding a we get a, 2a, a + 6. Combining it with 0, a, 2a we see that 
we also have 0, a, a + 6. Thus we have the following rule: If two 
elements a and b are on the same side of 0, then their sum is also on 
this same side. 

Denote now by P the set of all elements which are on the same 
side of as the element 1. Then we know that k is the disjoint union 

-PU {0} UP 

and that P + P C P. These are the first two axioms of an ordered 
field. It is worth while noticing that we used only condition 1) of 
Definition 1.15. 

Suppose a, b t P. Then a is on the same side of as 1. Multiplying 
by b we get that ab is on the same side of as 6. But this means 
ab*P. 

The field k is ordered. We see that the maps x > ax (for a j& 0) 
also either preserve or reverse the ordering. 

THEOREM 1.16. // a field k is weakly ordered and has more than 
two elements, then it is an ordered field. 

Hilbert has constructed an ordered non-commutative field and 
we shall present his example. 

Let F be any field and a an automorphism of F. We denote by 
a 9 the image of an element a t F under <7 and by a* its image under 
the automorphism or*. 

We construct an extension field k of F consisting of all formal 
power series 



in a variable t which contain only a finite number of non-zero co- 
efficients with a negative i; in other words we assume that for each 
power series an integer N can be found such that a, = for i < N. 
The addition in k is defined as usual but the multiplication is not 
the ordinary one. To multiply two power series one first multiplies 
termwise: 
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But now one does not assume that f can be interchanged with 6, ; 
instead of an ordinary interchange one uses the rule 

t*b = 6'V 
which for our product produces the series 

Z *,&;'"'. 

Finally one collects terms with the same power of /. The coefficient 
c r of will, therefore be 



It is a good exercise to make all this rigorous. We indicate the 
steps and leave the details to the reader. Since a power series is 
given by the coefficients we may just as well say that the elements 
of k are functions f(i) on the integers with value in F, thinking of 
the value f(i) as the coefficient of t\ We consider only functions / 
whose value f(i) = whenever i < N (some integer depending 
on /). The addition of functions is defined as usual but the product 
fg shall be the function whose value on i is given by 



One has to verify that the sum on the right side has only a finite 
number of non-zero terms and also that (fg)(i) = whenever 
i < N + M- N is the integer belonging to / and M the one belonging 
tog. 

It is easy to show that the two distributive laws and also the 
associative law hold. Since we wish to consider k as extension of F 
we have to map F isomorphically into k. The naive description by 
power series suggests how to do it. An element a e F is mapped onto 
the function a which satisfies d(0) = a and d(i) = for i ^ 0. One 
verifies^ that a > a is an isomorphism of F into the ring k. The 
image 1 of the unit element of F turns out to be the unit of k. If 
/ e k, then af is the function whose value at i is a-f(f) ) whereas fa 
has at i the value f(i)-a' % . 

One wishes also to get the power t m . To this effect one denotes 
by t m the function which has value 1 at m and j/alue at all other 
integers. Now one proves t m t n = / m+n and /<, = I, the unit element. 
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Therefore /7 = t m and abbreviating /! by / we see that we can write 

/ = r. 

Now one can prove 



One begins to drop the bar on top of a, identifying a with a. 

If / e fc, then// w is really a shift by m. One easily verifies the formulas 

(/O(0 = f(i - m) and (r/)(i) = (/(f - m)Y\ 

We wish to show that k is a field, i.e., that a non-zero element has 
an inverse. If / ?* 0, then by taking a suitable f* and a suitable 
a e F, the new function g = a// m will satisfy 0(0) = 1 and g(i) = 
for i < (its power series has no negative terms and begins with 1). 
If one can show that g has an inverse, then / will have the inverse 
t m g~ l a (since / = a~ l gt~ m ). One tries a hypothetical inverse h of 
the same type, &(0) = 1, h(i) = for i < 0. For /i(l), A(2), - one 
finds equations (starting with A(0) = 1) 

h(l)g(oy + fe(0) ff (l) = 0, 
h(l)g(iy + h(G)g(2) = 0, 
+ h(G)g(2) = 0, 



which allow successively to compute /i(l), /i(2), . The inverse 
will be a left inverse: f~ l f = 1. But mere group theory shows that 
then ff~ l = 1. Our ring fc is an extension field of F. 

At this point one can safely return to the naive power series 
description of fc. If / = 2^ t - a v t l e fc and if / ^ 0, then there exists 
an integer N such that a,- = if i < N but a# ^ 0. We shall call 
this a N the lowest term of /. If a N is the lowest term of / and b M 
the lowest term of g, then a N b M will be the lowest term of fg. 

It is now time to talk of the ordering of fc. We assume that F is an 
ordered field and try to define a set P of elements of fc which shall 
induce an ordering of fc. Let P consist of those / 9 whose lowest 
term a^ is positive in the ordering of F: a N > 0. The condition 
fc= PUJOJUP (disjoint) is obviously satisfied; the rule that 
the sum of two elements in P lies also in P is not difficult to verify. 
There remains the product rule. Here we obviously have to assume 
of our automorphism that a > implies a* > 0. Suppose that a has 
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this property; then a < will also imply that a* < 0. It follows 
that a > is implied by a ff > which shows that or" 1 has our property. 
We needed o-" 1 in order to be sure that all powers of o- have the same 
property. Now we can prove the product rule. If the lowest terms 
a N , respectively, b M of /, respectively, g are positive, then the 
lowest term a N b' M of fg will be positive. We see that fc is ordered. 

Is fc non-commutative? We have ta = a*t. As soon as o- is not the 
identity automorphism our field fc will be non-commutative. 

This reduces the construction problem a little. We must find a 
field F (commutative or not) which is ordered and which has an 
automorphism <r j* 1 such that <r preserves positivity. One can do 
this again by power series. Let Q be the field of rational numbers 
ordered in the usual fashion and F the field of power series in a 
variable x: 

]C <*&*> 


but this time with the ordinary multiplication of power series (hence 
with identity as automorphism). Let us order F by the lowest term; 
this is possible since the identity automorphism preserves the 
positivity in Q trivially. We must find an automorphism <r ?* 1 of 
F which preserves positivity. It is the substitution x 2x, or, more 
precisely: 

/= a,*'-r- Z<2V. 



To show that o- is an automorphism of F is a rather trivial exercise. 
That it preserves the sign of the lowest term is completely obvious. 
This finishes the construction of fc. 

THEOREM 1.17. There do exist ordered non-commutative fields. 

An ordered field has characteristic and contains, therefore, the 
field Q of rational numbers. Since the field Q can be ordered only in 
the usual way, the ordering of fc is in agreement with the usual 
ordering of Q. 

DEFINITION 1.16. An ordered field fc is said to be archimedean 
if for any a e fc one can find an integer n such that a < n. 

Let fc be archimedean and a e fc. There are integers n, m such that 
a < n and a < ra; then m < a < n. Every element of fc lies 
between integers. Consider now the set C of all rational numbers 
> a and the set D of all rational numbers < a. Neither C nor D 
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will be empty and these sets form a Dedekind cut in the rational 
numbers. Denote by /(a) the real number defined by this cut. We 
have a map / : k R where R is the field of real numbers and will 
show that / is one-to-one into. Let a 7* b, say 6 > a; we shall con- 
struct a rational number which lies strictly between a and 6 and 
thereby prove /(a) 7* f(b). There is an integer m > (b a)" 1 . 
Since b a > we obtain mb ma > 1. Among the integers n 
which satisfy n > ma there is a smallest one. For this n we have 
ma > n 1 > n (mb ma) = n mb + ma and get mb > n. 
The inequality 

mb > n > ma 

shows m > and b > n/m > a. We conclude f(b) > /(a). From 
the properties of Dedekind cuts one deduces that / is an isomorphism 
of k into R; f preserves the ordering as our previous discussion shows. 

THEOREM 1.18. An archimedean field is commutative, as a matter 
of fact, isomorphic to a subfield of the real numbers in its natural 
ordering. 

We saw that Q admits only one ordering. The field R of real numbers 
admits also only one ordering. Indeed squares of non-zero elements 
are positive in any ordering. Let a c R be positive in some ordering 
of R. Then a can not be a square which means that a > in the 
ordinary sense. Conversely if a > in the ordinary sense then a is 
a square and, therefore, positive in the given ordering. 

Let now cr be an automorphism of R] a carries squares into squares 
and preserves, therefore, the positivity in JR. If a < 6, then a* < V. 
The automorphism a- induces on Q the identity. Let a be a given 
element of 72. It is uniquely described by the Dedekind cut which it 
produces among the rational numbers. Since or preserves inequalities 
and leaves all rational numbers fixed, a 9 will have the same Dedekind 
cut as a. But this means a" = o. 

THEOREM 1.19. The field R of real numbers can be ordered only 
in the usual way. Every automorphism of R is the identity. 

10. Valuations 

DEFINITION 1.17. Let fc be a field and G an ordered multiplica- 
tive group with 0-element. We shall assume both k and G to be 
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commutative. A map of k into (?, denoted by a > |a| (read as "abso- 
lute value of a") is called a valuation of fc if it satisfies the following 
conditions: 

1) |a| = if and only if a = 0, 

2) |ob| = |a|.|b|, 

3) |a + b| < Max (|a|, |6|). 

REMARK. Condition 3) may be replaced by the rule 

3a) |a| < 1 implies 1 1 + a| < 1. 

Proof: Conditions 1) and 2) imply |1| = 1 so that 3) implies 3a). 

Suppose 3a) is true. Condition 3) certainly holds if a = b = 0. 
Otherwise assume that |a| < |6|. Then \a/b\ < 1 and 3a) implies 
|1 + a/b\ < 1. Multiplying both sides by |6| we get 3). 

All we shall really need are some examples of valuations. They 
are easily constructed by means of another notion. 

DEFINITION 1.18. Let k be a commutative field. A subring R 
is said to be a valuation ring if for every non-zero element of k 
either a or a" 1 lies in R. 

The elements of k for which both a and a" 1 are in R form a multi- 
plicative group C7, called the group of units. Those non-zero elements 
for which a~ l but not a is in R form a set S, and those for which a, 
but not a" 1 , is in R form the set S~ l consisting of the inverses of 
the elements in S. Clearly, 

k = {0} U S-'U UV S, 
and this union is disjoint. The ring R is: 

R = {0} VJ S' 1 U U. 

If a, 6 e R and ab t C7, then cfV 1 e R, hence a' 1 = 6- (a V 1 ) t R 
and b~ l = a(a~ l b~ l ) t R which shows a t U and b E U. This proves 
fiT 1 -*?- 1 C S' 1 and S' 1 U C S" 1 ; consequently 88 C Sand SC7 C S. 

The cosets of U form the set 



= {0} U S^/UV [U/U] U S/C7 

and this union is disjoint. k/U is a group with 0-element, S/U a 
semi-group. We may, therefore, consider k/U as an ordered group 
with 0-element. 

A valuation is now easily constructed: the map a > |a| shall 
merely be the canonical map of k onto k/U, in other words 
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|a| = aU. 

Conditions 1) and 2) are trivially satisfied. The inequality |a| < 1 
means aU is either in S~ l /U or in U/U = 1; this shows that |a| < 1 
is equivalent with a e R and implies trivially 1 + a e R and conse- 
quently |1 + a| < 1. 

The case R = k is not excluded. $ is then the empty set and 
k/ U has only two elements, and U/ U = 1 . We call this valuation 
the trivial valuation. 

Now our examples: 

1) Let k = Q the field of rational numbers and p a given 
fixed prime. Let R be the set of those elements of Q which can be 
written with a denominator relatively prime to p. R is obviously a 
valuation ring. U consists of those fractions whose numerator and 
denominator are prime to p. If a 5^ 0, then |a| = aU = p*U where 
p % is the contribution of p to a if we factor a into prime powers; 
|a| < 1 means a e R, hence i > 0. If |a| = p*U and |6| = p'U, then 
|a| < |&| implies |a/6| = p*~' U < 1 or i j > 0, i > j. An element 
of Q is "small" if it is divisible by a high power of p. 

This valuation of Q is called the p-adic valuation. 

2) Let k be an ordered field. An element of k is said to be finite 
if it lies between two integers; if it does not, we may call it infinite. 
One sees easily that the set of all finite elements forms a valuation 
ring R. The corresponding set S consists of the infinite elements of k, 
the set S~ l may be called the set of infinitely small elements; |a| < 1 
means now that a is finite. 

This valuation will be trivial if and only if k is archimedean. If k 
is not archimedean, then | a \ gives a classification of the infinitely 
large or small elements "according to size". 



CHAPTER II 
Affine and Projective Geometry 

1. Introduction and the first three axioms 

We are all familiar with analytic geometry where a point in a 
plane is described by a pair (x, y) of real numbers, a straight line by 
a linear, a conic by a quadratic equation. Analytic geometry enables 
us to reduce any elementary geometric problem to a mere algebraic 
one. The intersection of a straight line and a circle suggests, however, 
enlarging the system by introducing a new plane whose points are 
pairs of complex numbers. An obvious generalisation of this procedure 
is the following. Let k be a given field; construct a plane whose 
"points" are the pairs (x, y) of elements of k and define lines by 
linear equations. It is of course comparatively easy to work out the 
geometric theorems which would hold in such a geometry. 

A much more fascinating problem is, however, the converse. 
Given a plane geometry whose objects are the elements of two sets, 
the set of points and the set of lines; assume that certain axioms of 
geometric nature are true. Is it possible to find a field k such that the 
points of our geometry can be described by coordinates from A; 
arid the lines by linear equations? 

We shall first state the needed axioms in a loose fashion. The 
first two are the incidence axioms: there is exactly one line through 
two distinct points and exactly one parallel to a given line through 
a given point. The third axiom eliminates freak cases of geometries 
which are easily enumerated and merely assures us of the existence 
of enough points and lines. For a while we shall work only with these 
three axioms and prepare the ground for the fourth (and last) which 
is the most interesting one. 

One introduces certain symmetries of the geometry called dilata- 
tions. They are (except for freaks) one-to-one maps of the plane onto 
itself which move all points of a line into points of a parallel line. 

The identity map is such a dilatation and with only our three 
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axioms at ones disposal, one should not expect the geometry to 
have any others. This is where the fourth axiom comes into play. 
It is split into two parts, 4a and 4b. Axiom 4a postulates the existence 
of a translation (a special kind of dilatation) which moves a given 
point into any other given point. With this axiom 4a alone one can 
already construct a certain field fc. Axiom 4b shall now ensure the 
existence of enough dilatations of the remaining kind (which resemble 
the transformations of similarity in the euclidean plane). Using 
axiom 4b one can show that the points can be described by coordinates 
from k and the lines by linear equations. 

One can, therefore, conclude that one can coordinatize any geo- 
metry which contains enough symmetries. 

We shall now start with a precise formulation of the axioms. 

We are given two sets: a set of "points" and a set of "lines". We 
are also given a basic binary relation between a given point P and 
a given line I: "P lies on V 9 (which may or may not be true for the 
given pair P and I). All the axioms can be expressed in terms of 
this one basic relation. However it is clear that the language would 
become intolerably clumsy if one would actually do this. We shall, 
therefore, use obvious synonyms for the binary relation as "P is on 
I" or "I contains P" or "I goes through P". If P lies on both I and 
m we may say that I and m meet in P and if P is the only point on 
both I and m, then we say that "Z and m intersect in P" or that 
"P is the intersection of I and m". At the beginning we shall be a 
little stricter in our language until the reader has learned to replace 
the synonyms by the basic relation, but we will soon relax. 

For a similar reason we did not draw figures in the first paragraphs 
of this chapter. The reader will, nevertheless, get a better under- 
standing of the proofs if he makes simple sketches for himself. 

Our first task is now to define what is meant by "parallelism". 

DEFINITION 2.1. If Z and m are two lines such that either I = m 
or that no point P lies on both I and m, then we call I and m parallel 
and write Z || m. If Z and m are not parallel, we write Z Jf m. 

If Z Jf m, then there is at least one point P which lies on both 
Z and m. 

AXIOM 1. Given two distinct points P and Q, there exists a unique 
line Z such that P lies on Z and Q lies on 1. We write Z = P + Q. 
If Z Jf m, then there is exactly one point P which lies on both I and m. 
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Indeed, if there were two such points, then, by axiom 1, I = m, 
hence I \\ m. 

AXIOM 2. Given a point P and a line I, there exists one and only 
one line m such that P lies on m and such that m \\ I. 

THEOREM 2.1. "Parallel" is an equivalence relation. 

Proof: It is obviously reflexive and symmetric. To prove the 
transitivity assume l^ \\1 2 and 1 2 || J 3 . If there is no point on both 
li and Z 3 , then li \\ 1 A . If there is a point P on both ^ and 1 3 , then 
by axiom 2 (since / t || 1 2 and 1 3 \\ 1 2 ) we have l v = 1 3 , hence again 



DEFINITION 2.2. An equivalence class of parallel lines is called 
a pencil of parallel lines. 

THEOREM 2.2. Suppose that there exist three distinct pencils, 
wi , ir 2 and 7T 3 of parallel lines. Then any pencil w contains the same 
number of lines and this number is equal to the number of points on 
any line. 

Proof: Let I be a line of ir l , m a line of ir 2 . We have / Jf m so that 
there exists exactly one point P of Z which also lies on m. On the other 
hand let Q be any point of I. There is exactly one line m' \\ m, hence 
one line of 7r 2 such that Q lies on m ; . We have, therefore, a one-to-one 
correspondence between the points of I and the lines of 7r 2 : the number 
of points of / is the same as the number of lines of w 2 . Thus far we 
have shown: If two distinct pencils are given, then each line of one 
pencil contains as many points as there are lines in the other pencil. 
If TT is any pencil, then it is certainly distinct from at least two of 
our three pencils, say IT j KI and TT j ir 2 The number of points on 
a line of TTI is equal to the number of lines in ?r and equal to the 
number of lines in ir 2 . The theorem follows easily. 

AXIOM 3. There exist three distinct points A, B, C such that C 
does not lie on the line A + #. We also say that there exist three non- 
collinear points. 

The lines A + B and A + C are not parallel: otherwise (containing 
the point A) they would be equal and C would lie on A + JB. For 
the same reason A + B Jf B + C and A + C Jf B + C. There do 
exist now at least three distinct pencils of parallel lines and Theo- 
rem 2.2 applies. 
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EXERCISE 1. Enumerate carefully (not being prejudiced at all 
about existence of points or lines or points on lines) all cases where 
the first two axioms hold but where axiom 3 is violated. 

EXERCISE 2. Find the geometry with the least possible number 
of points in which all three axioms hold. Show that its structure is 
unique. 

2. Dilatations and translations 

In modern mathematics the investigation of the symmetries of a 
given mathematical structure has always yielded the most powerful 
results. Symmetries are maps which preserve certain properties. 
In our case they will be maps preserving "direction". 

DEFINITION 2.3. A map o- of points into points is called a dilata- 
tion if it has the following property: 

Let two distinct points P and Q and their images P 1 and Q' be 
given. If I' is the line || P + Q which passes through P', then Q' lies 
onZ'. 

We can give two examples. 

1) Map all points onto one given point. We shall call this a de- 
generate dilatation and any other dilatation non-degenerate. 

2) The identity map. 

We can now immediately prove 

THEOREM 2.3. A dilatation a is uniquely determined by the images 
P', Q' of two distinct points P and Q. Should P' = Q', then a is degen- 
erate and all points are mapped into P'. Should P' 7^ Q', then a is 
one-to-one and onto (every point is an image). 

Proof: 

1) Let R be any point which is not onP + Q. Then R + P Jf R + Q. 
Let V be the line through P 1 which is || R + P. By definition of a 
dilatation R' must lie on I 1 . Similarly, R' will lie on the line I" which 
is || R + Q and passes through Q'. The lines V and I" are not parallel 
and contain R f . We see that R f is uniquely determined. 

2) Let now R be a point of P + Q, say R j* P. Select any point 
S not on P + Q. The point R is not on P + S; otherwise R and P 
would lie on P + Q and P + S, hence P + Q = P + S, contradicting 
the choice of S. The image S' of S is determined by 1); we know the 
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images P f and S' of P and /S, and R is not on P + S. By 1) the 
image of R is determined. 

3) Suppose P 1 = Q'. The degenerate map r which maps every 
point into P' has the same effect as <r on P and Q. By the uniqueness 
which we have already shown a = r. 

4) Suppose P' 9 Q' and let R' be a given point. We have to show 
that there exists a point R whose image is R'. Suppose first that R' 
does not lie on P' + Q 1 . Then P f + R' | Q' + R'. Let I, be the 
line || P' + R f which passes through P and 1 2 the line || Q' + R' 
which passes through Q. Then ^ Jf 1 2 . There is a point R which lies 
on both /! and 1 2 . If R would lie on P + Q, then P + Q would meet 
li in P and R and 1 2 in Q and R so that P + Q must equal one of the 
lines /! or Z 2 . But P' + Q' is neither || to P' + R' nor to Q' + R'. It 
follows now from 1) that R has R' as image. If /' lies on P r + Q' 
we first select a point 8' which is not on P 1 + Q', find the point S 
whose image AS' is, and argue with P' and S' instead of P' and Q'. 
This finishes the proof of the theorem. 

As an immediate consequence we have the 

COROLLARY. If the dilatation o- has two fixed points, then a = 1, 
er is the identity map. 

DEFINITION 2.4. Let a be a non-degenerate dilatation, P a point. 
Any line containing P and vP shall be called a trace of P. If P ^ o-P, 
then the trace is unique and = P + <rP. 

THEOREM 2.4. Let <r be a non-degenerate dilatation, P a point and 
I a trace of P. If Q lies on I, then <rQ lies on I also. 

Proof: We can assume Q 7* P so that Z = P + Q. Then aQ ^ aP 
and the line <rP + aQ \\ I by definition of a dilatation. But I and 
o-P + vQ have the point <rP in common and are, therefore, equal. 
This shows that aQ lies on I. 

Theorem 2.4 has also an immediate 

COROLLARY. The intersection of two non-parallel traces is a 
fixed point. 
The possibilities for the traces of a non-singular dilatation a are: 

1) All lines, if and only if or = 1. 

2) All lines through a point P if cr 7* 1 and if P is a fixed point of o-. 
Indeed, any line through P is a trace and no other line could be a 
trace since <r j& 1 can not have more than one fixed point. 
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3) A certain pencil of parallel lines if or has no fixed point. 
This exhausts all possibilities and suggests a definition. 

DEFINITION 2.5. A non-singular dilatation T shall be called a 
translation if either r = 1 or if r has no fixed point. If r is a trans- 
lation j& 1, then the traces of r form a pencil w of parallel lines which 
we shall call the "direction" of T. 

THEOREM 2.5. A translation r is uniquely determined by the image 
of one point P. (Notice that we do not claim the existence of trans- 
lations with pre-assigned image point.) 

Proof: Let / be a trace of r which contains P. Any line parallel 
to / is also a trace of r; let V \\ l,l'j*l and Q on I'. Then rQ must 
lie on I'. Put m = P + Q and let ra' be parallel to m and contain rP. 
By definition of a dilatation rQ must also lie on m'. rQ lies on both 
V and m' and I 1 Jf m' since I Jf m. Thus rQ is uniquely determined. 
But a dilation is uniquely determined by the image of two points. 

We shall assume from now on that dilatations are non-singular 
unless we explicitly state the opposite. The definition of a dilatation 
can then be simplified to: 

Whenever P ^ Q we have <rP ^ <rQ and P + Q \\ aP + o-Q. 

Let <TI , o- 2 be two dilatations. We can form the combined map 
o-i (0*2 (P)) which we denote by <ri<r 2 . If P ^ Q, then <r 2 P 5* <r 2 Q hence 
and 



P + Q 

The map o-i<r 2 is again a dilatation. 

Since a dilatation <r is one-to-one and onto we can form the inverse 
map which will also be one-to-one and onto. If P 9* Q then 
<r~ l P 7* v~ l Q and <r~ l P + (T'Q || <r(cr" l P) + a(<r~ l Q) = P + Q 
which shows o-' 1 to be a dilatation. The dilatations form, therefore, 
a group. 

If r is a translation, then r" 1 is also a translation. Suppose indeed 
that r~ l has a fixed point P. Then r~ l P = P. Applying r to it we 
get P = rP which is only possible for the translation r = 1 in which 
case r"" 1 = 1 is also a translation. 

Assume r Y and r 2 are translations and suppose that r^r 2 has a 
fixed point: r^P = P. Then r 2 P = ri l P. Thus r 2 and TI I have 
the same effect on P which means that they are equal: r 2 = ri" 1 . 
Then r^r 2 = 1 which is a translation and we see that r^ 2 is a trans- 
lation in any case. The translations form, therefore, a group. 
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Let a be a dilatation and r a translation. We wish to show that 
<TT<T~ I = TI is again a translation. If r l has a fixed point P, then 
<TT(r~ 1 P = P which implies r<r~ l P = <r~ l P, showing that <r~ l P is a 
fixed point of r. Hence r = 1, TI = 1 which is also a translation. 
Suppose now that T 7* I and that the pencil TT is the direction of T. 
<r~ l P + T0~ 1 P is the r-trace of a~ l P and is, therefore, a line of TT. 
Since er is a dilatation, cr" l P + TcT'P || cr(er~ l P) + CT(T<F~ I P) = 
P + oTflr~ 1 P. The line P + <rr<r~ l P is also in TT and is a oro^-trace of 
P. This shows that oro-" 1 has also the direction TT. Let us gather all 
these results: 

THEOREM 2.6. The dilatations form a group D and the translations 
form an invariant sub-group T of D. If a- is a dilatation and T 5* 1 a 
translation, then T and oro-" 1 have the same direction. 

THEOREM 2.7. Identity and the translations having a given pencil 
TT of parallel lines as direction form a group. 

Proof: If T 5* 1, then P + rP = rP + T -I (rP) is a r-trace of P 
and a retrace of rP. 

If TI 7^ 1 and T 2 T^ 1 have the direction ?r, then P + T 2 P lies in v 
and contains T 2 P. It contains, therefore, also TiT 2 P, by Theorem 2.4. 
If TiT 2 P = P, then T!T 2 = 1; if T^P ^ P, then P + T 2 P = P + 
TiT 2 P which is a trace of 



THEOREM 2.8. // translations with different directions exist, then 
T is a commutative group. 
Proof: 

1) Suppose TI and T 2 have different directions. By Theorem 2.6 
the translation T^T^ has the same direction as T 2 , hence the same 
direction as Ti" 1 . If T^T^T^ 9* 1, then T^^rVi" 1 has the same 
direction as T 2 . But TI and TaTrVi" 1 have the same direction as TI 
so that TiT 2 Ti l T* l has also the direction TI . This is a contradiction 
and consequently r^r 2 ri l r^ 1 = 1 which shows TiT 2 = T 2 Ti . 

2) Suppose TI and T 2 have the same direction. By assumption 
there exists a translation T S whose direction is different from that of 
TI and T 2 . Therefore T S TI = TiT 3 . The translation T 2 T 3 must also 
have a direction different from that of TI ; otherwise T^-T^ = T 3 
would have the same direction. We know, therefore, also that 

Ti'(T 2 T 3 ) = (T 2 T 3 )-Tx 

From TiT a T 3 = T 2 TiT 3 we get now 
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REMARK. It is conceivable that the geometry contains only 
translations with a single direction IT (aside from identity). It is an 
undecided question whether T has to be commutative in this case. 
Probably there do exist counter-examples, but none is known. 

3. Construction of the field 

We can hope for a "good" geometry only if the geometry has 
enough symmetries. We postulate, therefore, 

AXIOM 4a. Given any two points P and Q, there exists a trans- 
lation TPQ which moves P into Q: 

T PQ (P) = Q. 

REMARK. Since the image of one point determines a translation, 
TPQ is unique; especially r PP = 1. Since we now obviously have 
translations with different directions, our group T of translations 
will be commutative. The geometric meaning of axiom 4a is obvious, 
another geometric interpretation shall be given later. 

We shall define new maps which will send T into T. 

DEFINITION 2.6. A map a : T > T shall be called a "trace 
preserving homomorphism" if: 

1) It is a homomorphism of T, meaning by this that 



( 

(We use the symbol r a for the image of T under the map a. for 
convenience.) 

2) It preserves traces, or, more precisely: the traces of r are among 
the traces of r a . This means that either r a = 1, or that r and T" 
have the same direction. 

Important examples of trace-preserving homomorphisms are the 
following. 

a) Map every translation T of T onto identity: r a = 1 for all 
r t T. Clearly both conditions are satisfied. We denote this very 
special map by since r = 1 is very suggestive. 

b) The identity which will be denoted by 1. Thus r 1 = r for all 

TtT. 

c) Map each r onto its inverse r" 1 . We denote this map of course 
by T~ I . We have 
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( TlT2 )- = T JV = rrV 

since T is commutative; r "* arid r have the same traces. 

d) Let <7 he a fixed dilatation. Map each r onto orcr" 1 . We know 
already that r and error" 1 have the same traces. Furthermore 

/ \ -i -i -i 

<r(T\T2j<r = <TTi<r <rr 2 <r 

which shows condition 1. We do not give a special name to this map. 
The set of all trace-preserving homomorphisms shall be denoted 
by ft. 

DEFINITION 2.7. Let a, ft be elements of fc. We may construct a 
new map which sends r into r " - T P and shall call this map a + ft. Thus 

T+' = T V. 

We may also send r into (r^) a and call this map a- ft. Therefore 



THEOREM 2.9. // a and ft belong to k, then a + ft and aft belong to k. 
Under this definition the set k becomes an associative ring with unit 
element 1. 

Proof: 

1) ( Tl r 2 ) af/3 = (r^r (r,r 2 Y (by definition of a + ft) 

= r\r 2T?r? (since a, ft e k) 

= r"r?r "r? (since T is commutative). 
Hence 

( 
T 



If r = 1, then r a = / = 1, hence r a ^ = 1. 

If T j 1, then its direction TT occurs among the traces of r a and 
r" and, therefore, also among the traces of r tt -r ft = T a+ft . This shows 
a + ft e fc. 

2) 






The traces of T are among those of r f , hence among those of 
(/)" = T''. 
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3) If a, j9, y E fc, we show (a + 0) + 7 = <* + (P + r): 

<a+/J) + Y __ +i T _ ^T __ --.0 + T + <A+T> 

I " T T ' Til ~~~* T 7 """" I 

(we used the associativity of 3F). 

4) a + 13 = + *: 

r +/? == r r ^ = T /J T (since Tis commutative) 

= T ft+a 

5) + a = a: 

T 0+ = T . r a = l. r = T a . 

6) a + (-l)a0: 
T .+c-i> = T -. T (-n. = T .( T )- = 1 = r. 

This shows that k is a commutative group under addition. 

7) (0 + 7)a = Pot + T: 



Notice that only definitions of addition and multiplication are used. 
8) o(j8 + 7) = o0 + 07: 



Here we had to use that a is a homomorphism. 
9) 



10) l-o = a and ! = a: 

T 1 - ( T a ) 1 - T* and T"' 1 = (r 1 )" = r". 

This establishes all axioms for a ring. 

In our next theorem we use axiom 4a to the full extent. 

THEOREM 2.10. Let a e fc, a ^ and P be a given point. There 
exists a unique dilatation a- which has P as fixed point and such that 



CHAPTER II 61 

- 1 for all r c T. 



Proof: 

1) Suppose such a <r exists. Let Q be any point and select for T 
the translation r PQ . Then 

a 1 

TPQ = (TTpgCr . 

Apply this to the point P remembering that cr leaves P fixed: 

TP (P) = ^^(P) = C7T PO (P) = <KQ). 

Thus 

(2.1) <KQ) = r* PQ (P). 

This shows how to compute the image of a given point Q under 0: 
One takes r PO , computes TPQ and applies this to P. This proves 
not only the uniqueness but shows us also which map <r has to be 
tried in order to prove our theorem. 

2) Let now a t k and allow even a = 0. Define a map a on the 
points Q by formula (2.1). We contend that a is a dilatation, but 
possibly degenerate. 

Let Q and R be two distinct points. The translation T QR T PQ moves 
P into R since r QR T PQ (P) = r QR (Q) = 72. Therefore 



TQRT PQ 

If we apply a to this equation, we get 



Applying both sides to the point P we obtain 



hence, by formula (2.1), 
(2.2) 

Let i be a line through <r(Q) which is parallel to Q + 72; Z is a trace 
of TO* and hence a trace of TQ R . cr(Q) lies on this trace of TQ R ; thus 
its image <r(R) must also lie on I. But this is the condition <r must 
satisfy to be a dilatation. By (2.1) we have <r(P) = rp>(P) = 1 "(P) = 
1(P) = P so that a has P as fixed point. If <r is degenerate, then 
every point is mapped into P and (2.1) gives P = r?o(P); hence 
T PQ = 1 for all Q. Any translation r has the form r PQ , namely for 
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Q = , r (p) m Therefore r a = 1 f or all r e T which means a = 0. If 
a T* 0, then cr is not degenerate. 

Let now a ?& 0. Since we know <r(P) = P, we can write (2.1) in 
the form 



or 

Q = <r l T ; Q <r(P). 

This means that the translation o > ~ 1 (rp )((r" 1 )~ 1 moves P into 
and is consequently r PO . Hence 



-1 a 

<r 



or 



We have already mentioned that r PQ can be any translation r, 
and have, therefore, 

T* = <r T( r- 1 for all T e T. 
But this is the contention of our theorem. 

REMARK. Example d) of the trace-preserving homomorphisms 
shows the converse. If a- is a dilatation, then r * arcr" 1 is a trace- 
preserving homomorphism a; it can not be since orcr"" 1 = 1 holds 
only f or T = 1 and not for all r. We see, therefore, that example d) 
has already exhausted all possibilities for a trace-preserving homo- 
morphism. To a given a j we can find a <r and even prescribe the 
fixed point for <r. 

THEOREM 2.11. k is afield (possibly not commutative under multi- 
plication). 

Proof: Let atk and a 5^ 0. Find a dilatation <r such that r a = oro-"" 1 . 
The map which sends r into o^Vcr = (r~ l r(<r~ l )~ l is also an element 
of k which we shall call a" 1 . Thus 

r a = ffTff" 1 and r a * = <r~ l r<r. 
Now, 

er'a"" 1 / a~' I \a / 1 \ 1 1 

r = (T ) = 0(0 T<r)<r = r = r 



CHAPTER II 63 

and 

a~ l a / axa" 1 I/ ~1\ 1 

r = (T ) = <7 (<TT<? )<j = r = r 

which shows aa" 1 = af^a = 1 and establishes the existence of an 
inverse (its uniqueness follows from group theory, provided the 
existence is established). 

THEOREM 2.12. // r a = 1 holds for one a and one T, then either 
a = or r = 1. If r a = r ft holds for particular a, ft, r, then either 
T = 1 or a = ft. 

Proof: Suppose r" = 1 and a 7* 0. Apply a~ l : 

( T )- 1 = I"' 1 = l or r = 1. 

Suppose now r a = T**. Multiplying by r~ ft we have r a ~ ft = 1. If 
a 5* 0, then r = 1. 

(The element is of course ( 1) ft and also /3( l), since 
ft + ft-(-l) = 0(1 + (-1)) = ft-Q = 0. The rule 0-0 = holds in 
every distributive ring but is also checked directly: r"' = (r / = 



4. Introduction of coordinates 

Our geometry is still not "symmetric" enough. We shall need one 
more axiom which we shall put into two forms; the equivalence of 
the two forms will be a separate theorem. 

AXIOM 4b. If T! and r 2 are translations with the same traces and 
if T! 7^ 1, r 2 T^ 1, T! 3^ r 2 , then there exists an a in fc such that 

a 

r 2 = r i . 

REMARK. If 73 = 7!, then r 2 = rl ; if r 2 = 1, then r 2 = T? . 
Thus only TI 3^ 1 is needed, but we wanted to state the axiom in 
its weakest form. Theorem 2.12 shows that the a is unique. 

Now we state the other form of the axiom. 

AXIOM 4b P (for a given point P). Given two points Q and R 
such that P, Q, R are distinct but lie on a line; then there exists a 
dilatation o- which has P as fixed point and moves Q into R. 

THEOREM 2.13. // axiom 4b P holds for one point P, then axiom 4b 
is true, and axiom 4b implies axiom 4b P for all points P. 
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Proof: 

1) Assume axiom 4b P true for a particular P. Let TI and r 2 be 
translations satisfying the hypothesis of axiom 4b and set ?J? = Q 
and T 2 P = R. Since r l and r 2 have the same traces, the points P, Q, R 
are collinear and are distinct by the assumptions on TI and r 2 . 
Let <r be the dilatation with fixed point P such that a(Q) = R. 

1 is a translation and a > r 1 o-~ 1 (P) = 0Ti(P) = a(Q) = #. Hence 
1 = r 2 . Let a e k such that r a = ow" 1 . Then r 2 = r" . 

2) Assume that axiom 4b is true and let P, Q, 72 be three distinct 
collinear points. Put r = r PO , r 2 = r PR . Since P, Q, J? are collinear, 
T! and r 2 have the same traces, TI 3^ 1, r 2 ^ 1 and TX 5^ r 2 . By 
axiom 4b there exists an a 9* such that r 2 = r" . By Theorem 2.10 
there exists a dilatation with fixed point P such that r a = orcr" 1 
for all r e T, especially 

T 2 = <TTi<T~ , T 2 (T = (TTi . 

If we apply this to P we get r 2 (P) = ori(P) or R = er(Q); i.e., o- is 
the desired dilatation. 

The geometric meaning of axiom 4b P is clear and another one 
shall be given later. 

THEOREM 2.14. Let TI 7* 1, r 2 7* 1 be translations with different 
directions. To any translation T e T there exist unique elements a, ft 
in k such that 

T = r irj = rjr i . 

Proof: The commutativity is clear since T is commutative. 

1) Let P be any point and suppose r(P) = Q. Let li be a retrace 
through P and Z 2 a retrace through Q. Then Z t Jf / 2 so that there is 
a point R on /i and / 2 . 

The translation r PR is either 1 or has the same direction as TI . 
The translation r RQ is also either 1 or has the same direction as r 2 . 
By axiom 4b and the remark there do exist elements a and ft in k 
such that T PR = r" and T RQ = r 2 . Then r 2 r1(P) = T RQ r PR (P) = Q 
and hence r/ = r 2 r" . But r PO is our r. 

2) If T T{ = r \r\ , then rf" 7 = rj" 1 . If r?" 7 5^ 1, then the left 
side of Ti~ y = r 2 " /s has the direction of r l , the right side the direction 
of T 2 which is not possible. Thus rj*" 7 = 1 and r 2 "^ = 1 . Theorem 2.12 
shows a = 7 and = 6. 
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We are now ready to introduce coordinates. 

In ordinary geometry one describes them by selecting an origin 0, 
by drawing two coordinate axis and by marking "unit points" on 
each of the axis. 

We shall do precisely the same. We select a point as "origin" 
and two translations TI ^ 1 and r 2 j* 1 with different traces. The 
naive meaning of TI and r 2 is the following: The retrace through 
and the r 2 -trace through shall be thought of as the coordinate 
axis and the two points ri(0) and r 2 (0) as the "unit points". 

Let now P be any point. Write the translation r p in the form 



with the unique , 17 1 k and assign to P the pair ( , 77) as coordinates. 
If conversely (, 77) is a given pair, then T\T\ is a translation, it takes 
into a certain point P so that TOP = rirj , and this point P will 
have the given coordinates (, 77). We write P = (f, 77). 

For P = we get TOO = 1 = T\T^ so that the origin has coordinates 
(0, 0). 

For (1, 0) we have rlrj = TI = TOP so that r^O) is indeed the 
"unit point" on the first axis and similarly r 2 (0) the unit point on 
the second axis. 

Let now I be any line, P = (a, 0) a point on I and r = r \T\ a trans- 
lation 7* 1 which has / as trace. Let Q = (, 77) be any point on I. 
Then T PQ is either 1 or has the direction of r. By axiom 4b and the 
remark we have 

TPQ = T 

with some t e fc. 

Conversely, for any /, Z will appear among the traces of T' ; / (P) = Q 
is some point on i which shows that every r* is some T PQ with Q on I. 
We also know that t is uniquely determined by T PQ (Theorem 2.12). 

To get the coordinates of Q we must compute TOO We have 

(Y *\< a $ _ ty + a tt+ft 
T i T 2 ; T i T 2 T! T 2 

Thus Q = (fy + a, J5 + 0) will range over the points of I if t 
ranges over fc. How much freedom can we have in the choices of 
a, 0, 7, 5 and still be certain that'Q = (ty + a, t8 + 0) describes the 
points of a line? 

Since P = T*IT* can be any point, a, /} are arbitrary; T = T>J had 
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to be a translation j* 1 which means that y and 8 are not both 0. 
If this is the case, then the line through P which is a r-trace is the 
line that our formula describes. 

We may abbreviate in vectorial notation: 
We put A = (7, d) and P = (a, 0). Then 



t ranging over fc, gives the points of a line. A j is the only restric- 
tion. 

The reader may be more familiar with another form of the equation 
of a straight line: 

Put Q = (x, y). Then x = ty + a, y = / + ft. 

We consider two cases: 

a) 7 7^ 0. Then / = xy~ l ay~ l and consequently y = xy" 1 8 + ft 

ay~ 1 d. This is the form 

i 

y = xm + 6. 

Converse, if y = xm + b y put x = /, 2/ = tm + b and we have a 
"parametric form" with y = 1, 8 = m, a = 0, = b. 

b) 7 = 0. Then 5 5^ so that y is arbitrary and x = a. 

We have certainly recovered the usual description of points and 
lines by coordinates. 

5. Ajfine geometry based on a given field 

We shall start now at the other end. Suppose that k is a given 
field and let us build up an affine geometry based on k. 

We define a point P to be an ordered pair (, 77) of elements , rj e k. 
We shall identify P also with the vector ( , rj) and use vector notation. 
A line Z will be defined as the point set P + tA where P is a given 
point, A a given non-zero vector and where / ranges over k. The 
relation that a point Q lies on a line I shall merely mean that Q is an 
element of the point set we called L Our problem is now to prove 
that this geometry satisfies all our axioms. 

Let us first consider the possible intersections of a line P + tA 
and Q + uB where A and B are non-zero vectors and where / and 
u range over k. For a common point we must have 

P + tA = Q + uB 
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and we must find the / and u satisfying this equation which we 
shall now write in the form 

(2.3) tA - uB = Q - P. 

Case 1. A and B are left linearly independent an equation 
xA + yB = always implies that x = y = 0. Then any vector can 
be expressed uniquely in the form xA + yB and (2.3) merely requires 
that Q P be expressed in this form. We see that there is exactly one 
/ and u solving (2.3) and this means that the intersection is exactly 
one point. 

Case 2. A and B are left linearly dependent elements a, ft in k 
exist which are not both zero such that aA + &B = 0. Then a j 0; 
for, otherwise pB = 0, hence /3 = 0, since we have assumed B 9* 0. 
Similarly ft j* 0. Then = ?A where 7 = -/T 1 * ^ 0. 

We can then simplify the line Q + uB = Q + uyA =* Q + vA 
with y = uy-j if w ranges over &, so will v. We may assume, therefore, 
B = A and (2.3) becomes 

(2.4) (t - t*)A = Q - P. 

If Q P is not a left multiple of A, we can not possibly solve this 
equation the two lines do not intersect, they are parallel. If 
Q P = d A is a left multiple of A, then Q = P + 8 A and the line 
Q + uA becomes P + (u + $)A. If u ranges over A*, so does u + d 
and we see that the two lines are equal and, therefore, parallel again. 

We see now: two lines P + tA and Q + uB are parallel if and only 
if B = yA. A pencil of parallel lines consists of the lines P + tA for 
a fixed A if we let P range over all points. 

Our analysis has also shown that there can be at most one line 
containing two distinct points. Let now P ^ Q be two distinct 
points. Then Q - P ^ so that P + t(Q - P) is a line. For t = 
we get P, f or t = 1 we get Q as points on this line. Thus axiom 1 is 
verified. 

If P + tA is a given line and Q a given point, then Q + uA is a 
parallel line and contains Q (for u = 0). Our discussion shows the 
uniqueness. This is axiom 2. 

The points A = (0, 0), B = (1, 0), C = (0, 1) are not collinear: 
the line A + t(B A) = (/, 0) contains A and B but not C. Axiom 3 
holds. 

Consider next the following map a: select a e k and any vector C. 
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Put 

(2.5) ' aX = ctX + C. 

If a = 0, then every point is mapped onto C and a is a dilatation. 

Let a 7* 0. The image of the line P + tA is a(P + tA) + C = 
aP + C + atA = (aP + C) + uA since u = at will range over fc 
if J does. 

The image of the line P + tA is the line (aP + C) + uA which is 
parallel to P + tA. Thus a is a dilatation and a non-degenerate one, 
since a line goes into a line. 

The fixed points of a- must satisfy 

X = ctX + C, (1 - a)X = C. 

We have two cases: 

1) a 5* 1; then (1 a)~ l C is the only fixed point. 

2) a = 1; if C T* 0, there is no fixed point, if C = 0, then <r = 1, 
the identity. 

We see that the translations among our maps are of the form 
X X + C. Let us denote this translation by T C . Obviously 
T Q - P (P) = P + (Q P) = Q which verifies axiom 4a and shows 
that the translations r c are all translations of our geometry. 

To verify axiom 4b P let P, Q, B be three collinear points, P 7* Q, 
P 7* R. Since J? lies on the line P + t(Q P) which contains P 
and Q, we can determine a in such a way that 

P + a(Q - P) = /?. 

This a will be ?* since R j& P. With this a we form the dilatation 
v(X) = aX + P - aP. 

Clearly <r(P) = P and a(Q) = R which shows that axiom 4b P 
holds for all P and also that any dilatation with fixed point P is of 
this form. We know, therefore, that (2.5) gives all dilatations of our 
geometry. 

Next we determine all trace-preserving homomorphisms of the 
translation group T. By Theorem 2.10 we may proceed as follows. 

We select as the point P the origin (0, 0). If v(X) = aX + C 
has (0, 0) as fixed point then (0, 0) = C. The dilatations with P 
as fixed point are of the form 

<rX = aXj a ^ 0. 
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The a is uniquely determined by <r. The maps r > ora" 1 are then 
all trace-preserving homomorphisms j* and different dilatations 
or will give different homomorphisms. We have <r~ l X = oT l X. For 
the translation r c we get as image 

<TT C <T~\X) = <TT C (a~ l X) = cKcT'X + C) 

= aCcT'X + C) = X + a<7 = r aC . 

We see that for a given a 7* the map r c > r a(7 is the desired 
trace-preserving homomorphism. To it we have to add the 0-map 
which maps each r c into 1 = T O . This also can be written in the form 
rc * r aC j namely for a = 0. 

Denote now the map r c r aC for a t k by a and let A; be the 
field of all trace-preserving homomorphisms. We have just seen that 
a > a is a one-to-one correspondence between the field k and the 
field k. We contend that it is an isomorphism and have to this effect 
merely to show that a + ft = a + J5 and that aft = aft. 

The map a sends r c into r aC , ft sends it into r ftc and a + ft (by 
definition of addition in k) into T aC 'T ftC 
But 

r aC (X + ftO = X + ftC + aC 



a + j8 sends, therefore r c into r (a+/5 )c and a + ft does the same. 

The map ft sends r c intone and aft sends it into the image of r 0c 
under a which is T a $ c . aft does the same. The fields fc and k are 
indeed isomorphic under the map a > a. 

Now we introduce coordinates based on the field k. We select our 
origin (0, 0) also as origin for the coordinates in k, select TI = r c , 
T 2 = T D where C = (1, 0) and D = (0, 1). Let (f, ?;) be a given point P. 
Then 

TI = TC = r f (7 and rj = TO = Tr, D ; 
r f (7T, D (0, 0) = ric(riD) = 17!) + ^ = (f, r;) = P. 

The translation rfrf will send the origin into P. The prescription 
was to assign the coordinates (, rj) to the point P. We see, therefore, 
that P = (, 77) receives merely the coordinates (|, ?/) if we base the 
coordinates on K. 
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This means that the coordinates are "the same" apart from our 
isomorphism and shows that on the whole we have recovered our 
original field k. Notice that the isomorphism k -> k is canonical. 

6. Desargues' theorem 

Assume now of our geometry only that the first three axioms hold. 
The following theorem may or may not hold in our geometry (see 
Figure 1). 

THEOREM 2.15. Let li , 1 2 , J 3 be distinct lines which are either 
parallel or meet in a point P. Let Q, Q' be points on li , R, R 1 points 
on 1 2 and S, S' points on 1 3 which are distinct from P if our lines meet. 
We assume 

Q + R\\Q' + R' and Q + S \\ Q' + S' 
and contend that then 

R + S || R' + S'. 

This theorem is called Desargues' theorem. 

o 



4 





z 




Figure 1 

REMARK. There are certain trivial cases in which this theorem 
is obviously true: 
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1) If Q = Q', then Q + R = Q' + R' and since 1 2 and Q + R 
have R in common but not Q it follows that R = R' and S = S'. 
Thus we may assume, if we wish, that the points are distinct. 

2) If Q, R, S are collinear, then Q', 72', S' are collinear and the 
statement is true again. 

If the lines Zi , 1 2 , 1 3 are parallel we shall call our theorem D a . 
If they meet in a point P, then we shall call it DP. 

THEOREM 2.16. Axiom 4a implies D a and axiom 4b P implies DP. 

Proof: Let a be the translation which sends Q into Q' (if we wish 
to prove D a ) or the dilatation with P as fixed point which sends Q 
into Q' (if we wish to prove DP). The lines li , 1 2 , J 3 will be among 
the traces of 0. Since 

Q + R || <rQ + <rR = Q' + <rR, 

we obtain <rR = R f and similarly <rS = S'. 
Since 

R + S || R + <rS = R' + S' 
our theorem is true. 

THEOREM 2.17. D a implies axiom 4a and DP axiom 4b P. 
Proof: 

1) Suppose each line contains only two points. Then there are two 
lines in a given pencil of parallel lines, hence only four points A, B, 
C, D in our geometry. There are six lines A + B, A + C, - - each one 
containing only the two points by which it is expressed. Therefore 
this geometry must have a unique structure. On the other hand there 
is a field k with only two elements 0, 1 and the rule 1 + 1=0. 
The geometry based on this field has only two points on a line and is 
therefore the four point geometry. This proves that all axioms hold 
in this four point geometry, especially axioms 4a and 4b. We may, 
therefore, assume that each line contains at least three points and it 
follows that to two lines one still can find a point not on these lines. 

2) Assume D a . With any pair Q, Q' of distinct points we are going 
to associate a map T Q>Q ' which will be defined only for points R 
not on the line Q + Q r . 

Let I || Q + Q' and / contain R; then I j* Q + Q'. 
Let m || Q + R and m contain Q'; then m 7* Q + R. 
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Since C + Q'>ffQ + #we have I jf m. Let R' be the point of inter- 
section of m and I (Figure 2). 

R' lies on I, hence not on Q + Q', it lies on m, hence not on Q + R. 
This shows R' t* Q' and R' 7* R. 





Q + Q 1 



S S 1 

Figure 2 

The construction may be described in simpler terms: R + R r 
|| Q + Q' and Q + R \\ Q' + R'. These statements describe the 
image R' of R and T Q ' Q '. 

With this pair R, R' we may now construct the map T R ' R '. It 
will obviously send Q into Q'. Let now S be a point not on R + R' 
or Q + Q' and S' the image of S under T Q>Q/ . We have: R + R' 
II Q + Q' II $ + S' and these three lines are distinct. Furthermore 
Q + R || Q' + #' and Q + S \\ Q' + S'. Since we assume D a we 
can imply R + S \\ R' + S'. But the statements R + R' \\ S + S' 
and R + S \\ R f + S' mean now that S' is also the image of S under 
T R ' R \ The two maps r Q ' 9 ' and r R ' R ' agree wherever both are defined. 
Since we assume that a point S outside Q + Q' and R + R' does 
exist we can also form r St8 ' and know of the three maps T Q ' Q ', 
T R ' Rf , T S ' 8 ' that they agree wherever two or all three of the maps 
are defined. The desired map r is now the combination of the three 
maps: for any point T the image shall be the image under one of the 
three maps (for whichever it is defined). This map r has the property 
that r(Q) = Q' as we have remarked already when we introduced 
T R ' Rf . If we can show that r is a dilatation, then it is clear that r 
will be a translation since all traces have turned out to be parallel 
and the proof will be complete (the case Q = Q' is trivial identity 
is the map). 

Let U, V be two distinct points. One of the lines Q + Q', R + R', 
S + S' will not contain U and V and we may assume that Q + Q 1 
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is this line so that we may use T Q ' Q ' for computing images. If 
U + V || Q + Q', then U + V contains also U' and V. We may, 
therefore, assume that U + V Jf Q + Q'. Then U and V are in the 
same position as our previous 72 and S for which we have proved 
that 72 + S || R' + S'. This finishes the proof. 

3) If we assume DP, we can prove 4b P by a method completely 
analoguous to the previous case using the lines through P instead 
of lines parallel to Q + Q'. We can leave the details to the reader. 

Because of Theorems 2.16 and 2.17 our geometry is frequently 
called Desarguian geometry, our plane a Desarguian plane. Desargues' 
theorem is the other geometric interpretation of the axioms 4a and 
4b which we have promised. A third geometric interpretation shall 
be discussed later. 

7. Pappus 9 theorem and the commutative law 

One of the simple but fascinating results in foundations of geometry 
is the fact that one can find a simple geometric configuration which 
is equivalent with the commutative law for multiplication of our 
field k. 

Select an arbitrary point P. An element a ^ of k can be obtained 
(Theorem 2.10) by one and only one dilatation o- with P as fixed 
point in the form 

T* = (T a rcr~ l . 

j _ r<r~ l then T** = (V) a = a (T Ta~ l a~ l = 00 r(a <r )~ l 
on one hand and r aft = o- a/5 ro-aj on the other. Since cr a is uniquely 
determined by a. we have o- a/s = <r n <r ft , and this means that the 

s^ 
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multiplicative group of non-zero elements of k is isomorphic to the 
group of dilatations which have P as fixed point, fc will be commutative 
if and only if this group of dilatations is commutative. 

Select now two distinct lines I and m through P and let Q be any 
point 7* P on I. If <TI is a dilatation which has P as fixed point, then 
/ is a o-j-trace, 0iQ = Q f will be 9* P and may be any point on I 
by axiom 4b P. We describe, therefore, <r l completely by 0$ = Q'. 
Similarly let us select points ft, R f on ra which are distinct from P 
and let us describe another dilatation <7 2 with P as fixed point by 
<? 2 R = R' (Figure 3). 

We shall first construct the two points S = a^Jl on m and 
T = <r 2 <TiQ on Z. They are given completely by the descriptions: 



Q + R' || *iQ + *Jt' = Q' + wJl = Q f + S, 
R + Q'\\ <r*R + *2Q' = R' 



To have (T 1 a 2 = o^! it is necessary and sufficient to have 
<T 2 <TiQ or <7i(r 2 Q = T. Since <ri<r 2 Q lies on I it is determined by 

Q + R || <r lC r 2 Q + (7^2/e = cr^Q + S 

and we see that Q + # || T + S is the condition for commutativity. 
We can now forget <TI and a 2 and look at the configuration. T on I 
and S on m are determined by 

Q + R'\\Q'+S and Q' + R \\ R' + T 

and now we must have Q + R \\ T + S. Six lines Q + R', R' + T, 
T + S, S + Q', Q' + 72, 72 + Q occur in this configuration which 
may be viewed as a hexagon "inscribed" in the pair Z, m of lines 
with our six points as "vertices". Q + R' and Q' + S, Q' + R and 
R' + Tj R + Q and T + S are pairs of opposite sides of the hexagon. 
The configuration states that if two of these pairs are parallel, so 
is the third. 

This configuration is known as Pappus' theorem and we now 
may state 

THEOREM 2.18. The field k is commutative if and only if Pappus 9 
theorem holds. 

EXERCISE. Show that Pappus' theorem holds if the two lines 
I and m are parallel and that for this case the commutativity of k 
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is not needed. Do not use a computation with coordinates. Make 
use of translations. 

If a geometry contains only a finite number of points, then the 
group T is clearly finite and so will be the field k. If the reader has 
familiarized himself with Theorem 1.14 of Chapter I, 8 he will 
see the following geometric application. 

THEOREM 2.19. In a Desarguian plane with only a finite number 
of points the theorem of Pappus holds. 

No purely geometric proof is known for Theorem 2.19. 

8. Ordered geometry 

In some geometries, for instance in the case of the euclidean 
plane, the points of each line appear ordered. This ordering can, 
however, not be distinguished from the reversed ordering. An 
invariant way to describe such a situation is by means of a ternary 
relation: the point P lies "between" Q and R. Hilbert has axioma- 
tized this ternary relation and the reader may consult his book 
"Grundlagen der Geometric". We shall just follow the more naive 
way, considering a linear ordering and the reversed ordering as 
equivalent. We presuppose a knowledge of Chapter I, 9, at least 
Theorem 1.16. 

The ordering must, however, be related to the geometry of the 
plane. The following definition describes this relation. 

DEFINITION 2.8. A plane is said to be ordered if 

1) the set of points on each line is linearly ordered; 

2) parallel projection of the points of one line onto the points of 
another line either preserves or reverses the ordering. 

The main result will be the following theorem (see Chapter I, 
9, Definition 1.15). 

THEOREM 2.20. An ordering of a plane geometry induces canonically 
a weak ordering of the field k and a weak ordering of k induces canonically 
an ordering of the geometry. 

Proof: 

1) Assume the geometry is ordered. We order the field k by the 
following procedure: Select a point P and a translation T ** 1; 
then, as a ranges over fc, the point r a (P) will range over all points 
of the r-trace / of P. We have, therefore, a one-to-one correspondence 
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between the elements a of fc and the points r a (P) of I. The ordering 
of k shall be the one induced by the ordering of I. 

We must first of all show that this ordering does not depend on 
the selected P and r. 

Let us replace r by a translation TI 5^ 1 whose direction is different 
from that of r and let li be the retrace of P. We use again Theo- 
rem 2.10 stating that we can find for each non-zero element a t k 
a unique dilatation a a which has P as fixed point and which satisfies 
TO = <r a T <ra l for all translations T O . Hence To(P) = <r a T Q (r^ l (P) = 
<7T (P). We have, therefore, 

r-(P) = <r a r(P) and r^P) = cr^P). 

r(P) and r a (P) are different from P and lie on J, n(P) and r"(P) 
are also different from P and lie on li . We have 

r(P) + r,(P) || (r a r(P) + ^(P) = r a (P) + r<i(P). 

This means that the lines r a (P) + T(P) are all parallel that 
T*(P) is obtained from r a (P) by parallel projection. This is, of 
course, also true if a = since this gives the point P which is the 
intersection of I and h . By our assumption the ordering of k induced 
by P and r is either the same or the reverse of the one induced by 
P and TI . 

If r and T { have the same direction, replace P, r first by some 
P, T 2 where r 2 has a different direction and then P, T 2 by P, TI. 
This shows the independence from T. 

Let us now replace P by a point Q and assume first that Q does 
not lie on I. Call li the T-trace of Q; the lines / and Z, are parallel but 
distinct. We have 

P + Q || T*(P) + T a (Q) 

which shows again that the points T a (Q) are obtained from the points 
T a (P) by parallel projection. The induced ordering is the same. 

If Q lies on I, replace P first by a point Qi not on I and then Q! 
by Q. Our ordering of k does not depend on P and T in the weak 
sense that it may possibly be reversed. 

Now we must show that k is weakly ordered: Let 8 be a fixed 
element of k and order k first by P and T, then by T*(P) and T. The 
two orderings will either be the same or reversed. In the first case 
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the a will be ordered as the points r a (P), in the second case as the 
points T a (r\P)) = r a+ '(P). This shows that the map x - x + 8 
(x e /c) either preserves or reverses the ordering. Order also, if d 5* 0, 
k first by P and T and then by P and r*. In the first case, the a are 
ordered as the points r a (P), in the second case as the points (/) "(P) 
= r a *(P). This shows that the map x * xd either preserves or 
reverses the ordering, k is indeed weakly ordered. 

2) Assume now that k is weakly ordered. Let I be a given line. 
Select a point P on I and a translation r ^ \ which has Z as trace. 
r a (P) will range over the points of I if a ranges over k. Order the 
points on I as the corresponding a are ordered. We have to show 
again that the ordering does not depend on the choices of P and T. 

Another point Q of I has the form Q = /(P) for some fixed d t k. 
Then r a (Q) = r a+5 (P). The ordering of the a + d is the same or the 
reverse of the ordering of the a and this shows the independence on 
the choice of Q. Let T V be another translation 7* 1 with the same 
trace I Then n = r 5 with some d 7* 0. We find r(P) = (r') a (P) = 
r a5 (P). The a and the ad are ordered either in the same or in the 
reversed way. This shows the independence of our prescription. 

Finally we must show that parallel projection either preserves 
or reverses the ordering of the points on lines. Let I and m be distinct 
lines and suppose we are given a parallel projection mapping the 
points of I onto the points of m. We have to distinguish two cases: 

a) I and m are parallel. Then we may select the same r for both 
lines. Select P on I but select Q on m to be the image of P under the 
given parallel projection. The points of I are ordered according to 
r a (P), those of m according to r a (Q). But 

P + Q II r"(P) + r-(0) 

which means that r a (Q) is the image of r a (P) under our parallel 
projection. 

b) I and m intersect in P. Let r be any translation 7* 1 with trace 
I and Q the image of P under T. Q will be a point ^ P on the line I. 
Under our parallel projection it will have an image Q' on m which 
is 9* P. There is a translation TI which carries P into Q'. Its trace 
through P will be m. Order the points of I by r a (P), the points of 
m by r"(P). As in 1) we have 



r"(P) = er a r(P) = <r.(Q) and <r\(P) = ir.r.CP) = <r a (Q') 
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if a T* 0. Then 

Q + Q'\\ *a(Q) + *.(Q') = T(P) 

which shows that r"(P) is the image of T a (P) by our parallel pro- 
jection; this is trivially true if a = 0. We see that the ordering of 
m corresponds to the ordering of I under our parallel projection. 

3) Notice that the way to go from a weakly ordered field to an 
ordered geometry and the reverse procedure are consistent with 
each other. 

The geometry with only four points is a freak case. Each line 
contains only two points, it is ordered trivially since only one ordering 
and its reverse are possible. This geometry corresponds to the field 
with two elements which is a freak case of a weakly ordered field. 

Chapter I, 9, Theorem 1.16 implies now 

THEOREM 2.21. Aside from the plane with only 4 points and the 
field with only two elements the following is true: Any ordering of a 
Desarguian plane induces canonically an ordering of its field k and 
conversely each ordering of k induces an ordering of the plane. 

We shall leave aside the freak case. 

DEFINITION 2.9. An ordering of a Desarguian plane is called 
archimedean if it has the following property: 

If TI 5^ 1, r 2 7* 1 are translations with the same direction and if 
a point P does not lie between r^P) and r 2 (P), then there exists an 
integer n > such that r 2 (P) lies between P and r?(P). 

Let us translate what this means for the field. We can write 
r 2 = r" . The three points r?(P) = P, r^P) and r 2 (P) = r"(P) are 
ordered as are 0, 1, a. The assumption is that does not lie between 
1 and a so that a > 0. We wish to find an integer n such that r"(P) 
lies between r?(P) and r?(P). This means < a < n. By definition 
1.16 of Chapter I, 9 this means that fc is archimedean. Using Theorem 
1.18 of Chapter I, 9 we see 

THEOREM 2.22. The necessary and sufficient condition for an 
ordered geometry to come from a field k which is isomorphic to a subfield 
of the field of real numbers (in its natural ordering) is the archimedean 
postulate. Notice, therefore, that in an archimedean plane the theorem 
of Pappus holds the field is necessarily commutative. 

Chapter I, 9, Theorem 1.17 shows that one can not in general 
prove Pappus' theorem in an ordered geometry. 
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THEOREM 2.23. There do exist ordered Desarguian planes in which 
the theorem of Pappus does not hold. These geometries are, of course, 
by necessity non-archimedean. 

EXERCISE. A consequence of the ordering axioms is the fact 
that a map x ax (a 7* 0) either preserves or reverses the ordering. 
Give a geometric interpretation. 

9. Harmonic points 

Let II be a Desarguian plane. We shall describe a certain con- 
figuration (Figure 4) : 




A c B D 

Figure 4 

To three given distinct points A, B, C on a line I select three parallel 
lines li , 1 2 , I* which go respectively through A, B, C and are not 
parallel to I. Select on 1 3 a point P ~ C. The arbitrary elements are, 
therefore, the pencil TT of parallel lines containing ^ , 1 2 , 1 3 but not I 
and the point P 9* C on 1 B . 

The line P + A is not || 1 2 ; otherwise it would be the line Z 3 and 
A does not lie on 1 3 . The lines P + A and 1 2 intersect in a point R 
which is not on I', otherwise it would have to be the point B, and B 
does not lie on P + A since P is not on I. Similarly P + B will 
meet the line Z t in a point S which is not on /. The lines / and R + S 
are certainly distinct. If they are not parallel, they will meet in a 
point D; if they are parallel, we shall indicate this fact by putting 
D = oo. 

The contention is that D will not depend on the selected pencil IT 
and on the point P. We shall prove this by computing the coordinates 
of D since we shall need this computation for a later purpose. 
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The origin of our coordinate system shall be a given point on 
/ and TI a given translation 7* 1 with I as trace. This alone determines 
already the coordinates of any point X on I : X = ( , 0) where is 
determined by r\(O) = X. We have, therefore, the freedom to 
select T 2 as we please (without changing the coordinates of I) pro- 
vided T 2 9* 1, has not the trace I. We select r a as the translation which 
carries the point C into the point P. 

Let (a, 0), (6, 0), (c, 0) be the coordinates of A, B, C. The map r\ 
carries into (7 and r^r\ carries into P. The point P has, therefore, 
the coordinates (c, 1). The elements a, 6, c are distinct, li , 1 2 , Is 
are traces of r 2 . Any point on ^ will have coordinates (a, 77) since 
r5r!(0) = rJ(A); a point on Z 2 will have coordinates (6, 77'). The 
equation of the line P + A is 

y = (x - a)(c - a)' 1 

since it is satisfied by (c, 1) and (a, 0). R = (&, 17) lies on this line and 
we find 

r? = (b a)(c a)" 1 . 

Thus we have 72 = (6, (6 a)(c a)" 1 ) and we find, by symmetry, 
S = (a, (a - 6)(c - fr)" 1 ). 

Let T/ = xm + n be the equation for R + S (it is not of the form 
x = e since R + S is not || ii). The points JK and S lie on it and we 
infer: 

/ 2 6 ) ( & - a )( c - )~ 1 = bm + n > 

(a b)(c b)~ l = am + n. 

If we subtract, the factor b a appears on both sides and we obtain 
(2.7) m = (c - a)' 1 + (c - 6)- 1 . 

We stop for a moment and ask whether m can be 0. This would 
imply 

(c - a) = -(c - 6) 

or 2c = a + b. If the characteristic of k is 5^ 2 this happens if and 
only if c = %(a + 6), if C is the "midpoint" of A and B. If the charac- 
teristic is two, then = a + &ora = 6 which is against our assump- 
tion; therefore m = if and only if the characteristic of k is j& 2 
and if c = (a + 6). For m = the equation of R + S is y = n; 
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it means that I and R + S are parallel, hence D = . Thus the case 
where D = oo does not depend on TT and P. 

Suppose that m 7* 0, D = (d, 0). Since D lies on y = xm + n 
we obtain 

d = nm" 1 . 
If we multiply (2.6) by m"" 1 we get 

(2.8) (a - 6)(c - br l m~ l = a - d. 
Let us first compute the inverse of the left side: 
m(c - 6)(a - 6)" 1 

= ((c - a)' 1 + (e - &)-% - b)(a - &)- 
= (c - a)-'(c - 6)(a - 6)' 1 + (a - b)~ l 
= (c - a)-'((c - a) + (a - 6))(a - 6)" 1 + (a - ft)' 1 
= (a - 6)-' + (c - a)' 1 + (a - 6)- 1 
= 2(a - b)" 1 + (c - a)" 1 . 
Hence, from (2.8) : 

a - d = (2(a - 6)" 1 + (c - a)" 1 )' 1 
and finally 

(2.9) d = a - (2(a - 6)" 1 + (c - a)" 1 )" 1 . 

This result proves, of course, more than we have contended; notice 
that the expression for d uses only addition, subtraction and inverting. 
Suppose that the points of our geometry are the ordered pairs 
(f , 77) from a given field k. We have written the equation of a straight 
line in the form xa + yb + c = with the coefficients on the right. 
The reason for this can be retraced; it originates from the fact that 
all our mappings were written on the left of the object we wanted 
to map. Suppose that we had done it the other way around. Then 
we would also have obtained a field k connected with the geometry. 
This field would now not be the the same as the field fc, it would 
be an anti-isomorphic replica of fc, the multiplication reversed. If, 
on the other hand, we start with a given field and call the ordered 
pairs points, we may introduce two geometries which in general 
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will not be the same. In one of them lines are given by equations 
xa + yb + c = (let us call them "right lines"), in the other by 
equations ax + 6/y + c = 0. Only a few lines will coincide for instance 
the o>axis: y = 0, most of the others will not (unless the field is 
commutative). 

If we are given the three points (a, 0), (6, 0), (c, 0) on the x-axis 
we can construct D in the "left" geometry or in the "right" geometry 
over our fixed field k. It is pretty obvious that we would end up 
with the same value (2.9) for d since multiplication of two elements 
is not involved in the formula. 

We shall call A, 5, C, D harmonic points and D the fourth har- 
monic point to A, B, C. Then D is the same in the left and in the 
right geometry. This is the first reason why we have written d in 
the apparently clumsy form (2.9). 

Is D distinct from A, B, C? D = A would mean that R + S 
contains A. Since S 9* A, R + S = li which is false. By symmetry 
D ^ B. How about D = C? 

If the characteristic of k is 2, then (2.9) shows 

d = a ((c a)" 1 )" 1 = a (c a) = c 

and we have indeed D = C. 

Let k have a characteristic 5^ 2. If we had d = c, then we would 
get from (2.9) 

c- a = -(2(a- 6)- 1 + (c - a)' 1 )' 1 , 

(c - a)' 1 = -2(a - 6)- 1 - (c - a)' 1 , 

2(c - a)' 1 = -2(a - ft)' 1 , 

c a = a +6, c = 6, 
which is false. 

THEOREM 2.24. // the characteristic of k is 2, then the fourth 
harmonic point D coincides with C and this configuration characterizes 
the fact that k has characteristic 2. // k does not have characteristic 2 
(for instance in an ordered geometry with more than four points) the 
four harmonic points A, B, C, D will be distinct. 

Assume now that the characteristic of A; is 5^ 2. 

The construction itself exhibits a symmetry in A and B. D does 
not change if A and B are interchanged. But there are more sym- 
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metries (2.9) can be changed into an equivalent form by trans- 
posing the term a and taking an inverse: 

(d - a)' 1 = -2(a - b)' 1 - (c - a)~ l 
which is equivalent with 

(2.10) (d - a)' 1 + (a - b)- 1 = (a - c)' 1 + (b - a)' 1 . 

Interchanging C, D the condition remains the same. 
Consider now 

x~ l + y~ l = x~\x + y)y~ l = y'\x + y)x~\ 

Using this identity on both sides of (2.10) in two different ways 
and taking care that the factors (b a)" 1 and (a 6)" 1 = (fe a)" 1 
are factored out on both sides of the equation either to the left or to 
the right, we obtain after a cancellation two more equivalent forms: 

(2.11) (d - 6)(d - a)" 1 = -(b - c)(a - c)' 1 , 

(2.12) (d - a)~ l (d - 6) = -(a - cT*(b - c). 

If we interchange A and C and at the same time B and D in (2.11) 
we obtain 

(6 - d)(b - c)" 1 = -(d - a)(c - a)" 1 ; 
just shifting terms it becomes 

(d - d)~\b - d) = -(c - a)-\b - c) 

which differs only in sign from (2.12) and is, therefore, correct. 

We see that one should talk of two harmonic pairs A, B and C, D 
since we have established all the symmetries. We have, of course, 
assumed D 7* oo 9 a restriction, which will be easily removed in the 
next paragraphs. 

If our geometry is ordered, the two harmonic pairs A, B and C, D 
separate each other. Assume this were not the case. Placing the 
coordinate axis appropriately and making use of the symmetries 
we could assume that a < b < c < d. But then the left side of (2.11) 
would be positive and the right side would be negative which is a 
contradiction. 

Assume still that the characteristic of k is ^ 2. The notion of 
two harmonic pairs has introduced some structure in as poor a 
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geometry as that of a one dimensional line of a plane. It is now 
very natural to ask how "rigid" this structure is, in other words, 
what symmetries it has. 

Let a be a one-to-one map of the line I onto itself which carries 
harmonic points (finite points) into harmonic points. First of all 
o-" 1 will have the same property. To show this, let A', B', C', D' be 
harmonic points and A, J5, C, D their images under cr" 1 . If the fourth 
harmonic point D! of A, B, C is finite, then A', B', C", <r(Z>i) are 
harmonic and, therefore, <rD 1 = D 1 = <rD, D l = D. We are in doubt 
if c = (a + 6)/2; but if it were, then d j* (a + 6)/2 and we could 
conclude that the fourth harmonic point of A, B, D must be C. It 
follows now that three points whose fourth harmonic is at < must 
have images with the same property, or else we would get a contra- 
diction using o-" 1 . This shows that 



whenever a 5^ 6 (considering the triplet a, b, (a + 6)/2). If a = 6 
the equality is trivial. 

Make at first the special assumption that <r(0) = and o-(l) = 1. 
For 6 = 0, (2.13) becomes 



2 

This can be used on (2.13): 

(2.14) <r(a + 6) = *(a) + (K&) 

which means that a is an isomorphism under addition. Since <r(l) = 1 
we get a( 1) = 1. 

Let a = 1, 6= +1 and c 7* 0, 1, 1. From (2.9) we obtain 
for the fourth harmonic point 

Since -1 + (c + I)" 1 = (-(c + 1) + l)(c + I)- 1 = -c(c + I)' 1 
we have 

d = -1 + (c + l)<f * = -1 + 1 + c' 1 = c" 1 . 

In other words 1, 1, c, c"" 1 are harmonic. Their images 1, 1, 
<r(c), o^c"" 1 ) are again harmonic showing 
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(2.15) a(f') - We))' 1 , 

an equation which is trivially true for c = 1 so that only c j* 
is needed. Conversely, if a map satisfies (2.14) and (2.15), then 
one glance at formula (2.9) tells us that if d is the fourth harmonic 
point of a, 6, c, then a(d) will be the fourth harmonic point of o-(a), 
<r(6), <r(c), since only sums, differences and inverses arc involved in 
(2.9). Conditions (2.14) (2.15) together with <r(l) = I characterize 
such maps. 

We turn now to Chapter I, 8, Theorem 1.15 and see that a is 
either an automorphism or an antiautomorphism of k. 

To settle the general case where <r(0) = or o-(l) = 1 may be 
violated, notice that <r(x) <r(0) differs from a(x) by a translation 
and (cr(l) - <r(0)r l (<r(x) - cr(0)) = T(X) differs from <r(x) - <r(0) 
by a dilatation. Dilatations preserve the configuration of harmonic 
points. But r(0) = 0, r(l) = 1 so that r(x) = x r is either an auto- 
morphism or an antiautomorphism of k. Computing a(x) we find 
<r(x) = (o-(l) a(0))z T + or(0); it is, therefore, of the form 
<r(x) = ax r + b with a 7* 0. 

THEOREM 2.25. The one-to-one onto maps & of a line onto itself 
which preserve harmonic (finite) pairs are of the form 

<r(x) = ax r + 6 

where a ^ and where r is either an automorphism or an antiauto- 
morphism of k. 

REMARK. If identity is the only automorphism of k (k must be 
commutative in such a case, since otherwise there would be inner 
automorphisms), for instance, if k is the field of real numbers, then 
a(x) = ax + b. This result was first obtained by von Staudt. 

10. The fundamental theorem of projective geometry 

If V is a left vector space over a field k and V a left vector space 
over a field k f and if there exists an isomorphism /* which maps k 
onto A;', then we can generalize the notion of a homomorphism in an 
obvious fashion. 

DEFINITION 2.10. A map X : V > V is called semi-linear with 
respect to the isomorphism M if 
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1) \(X + Y) = \(X) + X(F), 2) \(aX) = <A(X) for all X, Y t V 
and all a e k. 

The description of all semi-linear maps follows the same pattern 
as that of homomorphisms. If A l , A 2 , , A n is a basis of F, if 
B { , B 2 , , # are arbitrary vectors in V (thought of as images of 
the basis vectors A t ) and if -Y = 2* x \^\ i s an arbitrary vector of 
Fthen 

X(X) = Z o^ 
t-i 

is a semi-linear map of V into V and every semi-linear map is of 
this form. It is one-to-one if and only if the B t are independent, 
and one-to-one and onto V if and only if the B t are a basis of V. 
We can leave these rather trivial details to the reader. 

With each left vector space V over k we form a new object, the 
corresponding projective space V. Its elements are no longer the 
single vectors of F, they are the subspaces U of F. To each subspace 
U of F we assign a projective dimension: dim,, U = dim 171, 
just by 1 smaller than the ordinary dimension. 

We take over the terminology "points' ' and "lines" for subspaces 
with projective dimension 0, respectively 1. Thus the lines of F 
become the "points" of F and the planes of F become the "lines" of 
F. The whole space F of ordinary dimension n gets (as element of F) 
the projective dimension n 1. The 0-subspace of F should be 
thought of as the "empty element" of F and has projective dimension 
1. An incidence relation is introduced in F; it shall merely be the 
inclusion relation U l C U 2 between the subspaces of F. Inter- 
section Ui C\ U 2 and sum U l + U 2 can then be explained in terms of 
the incidence relation of F: Ui C\ U 2 is the "largest" element con- 
tained in Ui and U 2 , U l + U 2 is the "smallest" element which 
contains both U l and U 2 . 

DEFINITION 2.11. A map <r : V V of the elements^ of a pro- 
jective space F onto the elements of a projective spaceF' is called 
a collineation if 1) dim F = dim F', 2) a is one-to-one and onto, 
3) U l C U 2 implies <r[7, C *U 2 . 

As an example of a collineation suppose that there exists a semi- 
linear map X : F > V which is one-to-one and onto._If we define 
<rU = Xf7, then o- is obviously a collineation of V > V and we say 
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that a is induced by X. The main purpose of this paragraph is to 
prove that every collineation is induced by some semi-linear map 
if dim F > 3._ 

Let <r : V > V ' be a collineation, U a subspace of V and dim V = n. 
We can find a sequence = U C U\ C U 2 C C U n = V of 
subspaces, where dim U f = j, such that U is one of the terms. 
By assumption: <rU Q C vU\ C C <rU n . Since C7 t - 5^ C7,- +1 and 
since <r was one-to-one it follows that o-Ui ^ <rU i+ i . The dimen- 
sions of the <rUi increase, therefore, strictly with i. But dim V = n 
puts an upper bound n on dim at/, . This implies dim at/, = i and 
consequently dim C7 = dim erf/. A collineation preserves the dimen- 
sion. 

Let [/i and U 2 be subspaces of V and suppose that <rll l C_ aU 2 . 
We can then find a supplementary space to crU l in crU 2 ; it must be 
the image of some W, in other words aU 2 = <rUi + orW with 
crU l C\ aW = 0. Since U C\ W is a subspace of C/i and of W we 
conclude that c^CA C\W) C <rt/i H o-TF = 0. This shows U^ H W = 0. 
Therefore 

dim(U 1 + W) = dim f/j + dim W = dim o-C/, + dim aW 

= dimCaC/! + <rW) = dim(<7C7 2 ), 
hen co 



On the other hand f/i as well as W are subspaces of C/! + W and 
consequently rCA + aW C <r(l/i + TF) or cri7 2 C ^(C/! + IF). Since 
the dimensions are equal we get crU 2 = <r(Ui + W) and since <r is 
one-to-one we have U 2 = f/i + IF; therefore finally Ui C. U 2 . 
All this was needed to show that a" 1 is also a collineation. Since <r 
as well as o-" 1 preserves the inclusion relation and since intersection 
and sum can be explained in terms of the inclusion relation we obtain 
also that 

<r(f/i H U 2 ) = rUi H *U 2 and a(Ui + U 2 ) = at/! + (7f/ 2 . 

Suppose that we know of <r only the effect on the "points" of F 
(they are the lines of F). If U is any subspace of F and 
U = L! + L 2 + + r where each L t - is a line, then aU = 
O-L! + <rL 2 + - - + <rL r and we see that or is completely known if 
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its effect on the lines of V is known. Its effect on lines will have 
the following property: Whenever L l C L 2 + L 3 then <rL l C 
(7/> 2 + (T//3 . We can now formulate and prove the fundamental 
theorem of projective geometry: 

THEOREM 2.26. (Fundamental theorem of projective geometry). 
Let V and V 1 be left vectorjspaces of equal dimension n > 3 over fields 
k respectively fc', V and V the corresponding projective spaces. Let a 
be a one-to-one (onto) correspondence of the "points" of V and the 
"points" of V which has the following property: Whenever three distinct 
"points" LH , L 2 , L 3 (they are lines of V) are collinear \ L l C L 2 + L 3 , 
then their images are collinear: ^L l C <rL 2 + <rL 3 . Such a map can 
of course be extended in at most one way to a collineation but we contend 
more. There exists an isomorphism /* of k onto k r and a semi-linear 
map X of V onto V (with respect to p) such that the collineation which 
X induces on V agrees with on the points of V. If Xi is another semi- 
linear map with respect to an isomorphism MI of k onto k 1 which also 
induces this collineation^ then \\(X) = \(&X) for some fixed a 7* 
of k and the isomorphism PI is given by x^ = (axa' 1 )". For any a 7* 
the map \(aX) will be semi-linear and induce the same collineation as X. 
The isomorphism n is t therefore, determined by <r up to inner auto- 
morphisms of k. 

REMARK 1. The assumption that LI , L 2 , L 3 are distinct is irrele- 
vant since LL C. L 2 + L 3 will imply trivially that vL^ C <rL 2 + <rL 3 
as soon as two of the "points" are equal. 

REMARK 2. If n = 2, then dim p V = 1. There is only one line, 
and a completely random one-to-one correspondence of the points of 
V and the points of V would be a collineation. The statement would 
be false for any field with at least 5 elements; only for F 2 , F 3 , F 4 
would it still hold. 

Proof: 

1) We show by induction on r: if L C LI + L 2 + + L r , 
then <rL C o^i + <rL 2 + + 0L r . This is trivial if L = L r . If 
L 7* L r , then L is spanned by a vector A + B where A e L l + L 2 + 
+ L r _ t (A 7* 0) arid B e L r . Then <r(A) C erLi + o-L 2 + 

+ o-L r _i by induction hypothesis and from L C (A) + L r we get 
o"L ^ <7\A) ~p &L r . 

2) We shall use frequently the following reasoning. Let C and D 
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be independent vectors and L C (C) + (D) but L ^ (D). Then L 
is spanned by a vector a(7 + bD, with a ^ 0, which may be replaced 
by C + a~ l bD = C + dD. The element d is then uniquely deter- 
mined by L. 

3) Let A t be a basis of 7, L< = (A,-), o-L, = (A<>. We have V = 
L t + L 2 + - + L n ; if L is any line of V then L C 7 and conse- 
quently o-L C <riii + <rL 2 + + o-L n . Since our map is onto, 
any line of V will have the form o-L which shows <rL l + o-L 2 + 

+ <rL n = ^' so that the AJ span V. But dim F' = n implies 
that the A( are a basis of V. The line (A l + A.) will be C (A,) + (A,-) 
and distinct from (A t -) if i > 2; hence tr(A l + A t -} = (A( + 6,-A<); 
since (A t + A,-} ?* (A x ) we will have 6, 5^ 0. Let us replace At by 
the equivalent vector 6 t A< . Then <r(A t ) = (A() for i > 1 and 
iKAt + A,) = <A{ + AJ) for i > 2. 

4) Let x e t; (A, + xA 2 ) C (A,) + <A 2 ) and (A l + xA 2 ) ^ (A 2 ). 
There is a uniquely determined x 1 e fc' such that ^(Ai + xA' 2 ) = 
(A( + x'Ag) and, since (A! + xA 2 ) j (A! + yA 2 ) if x 9* y, we have 
x' 7* y'. The map fc > fc' given by x #' is, therefore, one-to-one 
but only into, thus far. Clearly 0' = and 1' = 1 by 3). 

In a similar way we could find a map x > x" such that 
C^A! + xA 3 ) = (Al + x"Ai). We contend now that x' = x" for 
all x e fc. We can assume x 9* 0. 

The line (a:A 2 #A 3 ) lies in (A 2 ) + (A 3 ) on one hand and in 
(A! + xA 2 ) + (A! + #A 3 ) on the other. Its image is, therefore, 
spanned by a vector of (A 2 ) + (A 3) and also by a vector of 
(A( + x'A 2 ) + (A( + x"A$. The only possibility for the image is 
the line (z'A 2 a/'Ag). But (xA 2 zA 3 ) = (A 2 A 3 ) whose image 
is by the same reasoning (l'A 2 l"Aa) = (A 2 A 3 ). This shows 
(x r A 2 - x"A'*} = (A 2 - Ai) and x" = x' follows. 

Instead of A 3 we could have taken any vector A, with i > 3. 
We conclude that there is but one map x > x' such that, for i > 2, 

ff (A l + xA<) = (Al + x'Aft. 

5) Suppose a(A l + o; 2 A 2 + - + Xr^Ar^) = (Al + x 2 Ai + 

+ x'-iA'-i) is proved. The line (A! + z 2 A 2 + + x r A r ) 
lies in (Ai + 2 A 2 + + ^r-iA,.-.!) + (A r ) and is distinct from (A r ). 
Its image is, therefore, spanned by a vector of the form Al + x 2 A 2 + 

+ x'r-iA' r -i + uA' r . Our line is also in (A! + x r A r ) + (A 2 ) + 

+ (A r _i) which implies that the image is in (A I + x' r A' r ) + 
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(A 2 ) H; + (A'_i). Since the image must use A{ we get now u = x' r . 
We know, therefore, 

a(A, + x 2 A 2 + - + x n A n ) = <A( + Zs'Ai + - + xiA$. 

6) The image of the line (x 2 A 2 + + x n A n ) is in (A 2 ) + 
+ (A' n ). It also lies in (A v + x 2 A 2 + - + x n A n ) + (A,), its image, 
therefore, in (A{ + x f 2 A 2 + + zA> + (AJ>. Clearly <r(x 2 A 2 + 



7) A given line of F' which has the form (A ( + yA 2 ) is not the image 
of a line in (A 2 ) + + (A n ) as 6) has shown. Hence it must be 
the image of a line (Ai + x 2 A 2 + + x n A n ) y which implies x 2 = y 
and shows that the map k * fc' is onto. 

8) (KA, + (x + y)A 2 + A 3 > = (A[ + (x + y)'AJ + A 3 '). 
But 

(A, + (x + y)A 2 + A 3 ) C (A, + xA 2 ) + <yA, + A 3 ) 
so that 

(A( + (x + y)'A' 2 + Ai) C (A( + x'A' 2 ) + (y f A 2 + AJ). 
We deduce easily (x + yY = x 7 + 2/'. 

9) ff (A l + xyA 2 + xA 3 ) = (A( + (xyYA 2 + x'AJ) 
and 

(A 1 + xyA 2 + xA 3 } C (A,) + (yA 2 + A 3 > 
hence 

(A{ + (xyYA 2 + x'AS) C (A[) + <y'AJ + Ai) 

which implies (xy) 1 = x'y'. Our map x > x' is an isomorphism 
/z of fc onto A;'. 

The line (x^A^ + + z n A n ) has (x?A( + + zA) as image; 
this is clear if ^ = or Xi = 1. If x v 9^0 our line is also given by 
(A! + Xi l x 2 A 2 + + x^XnAn) and its image (A( + x^xZAi + 
+ x^XnAn) is the same line as (x^A( + + nAn)- 

10) Let \ be the semi-linear map of F onto V (with respect to 
the isomorphism /x) which sends A t onto AJ . For any line L of F 
we have shown o-L = XL. This proves the first part of the theorem. 
The rest is very easy. 

11) Suppose \i is another semi-linear map of F onto F' which 
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has the same effect on the lines of V. Then X'% is a map of V onto 
V which keeps every line of V fixed. It will map every non-zero 
vector X of V onto some non-zero multiple of X. Let X and Y be 
independent vectors of V. The three vectors X, F, X + Y will go 
respectively into aX, 0F, y(X + Y). But X^X^X + Y) = \' l \^(X) 
+ X" 1 Xi(F) = aX + 0Y. A comparison with yX + yY shows 
a = 0. If -XT, F are dependent but ?^ and Z a third independent 
vector, then both X and F take on the same factor, under X~% , 
as Z. It follows now that \~ l \i(X) = aX with the same a for all 
X (the case that X = is trivial) and consequently X^-Y) = \(aX). 

12) Let a 5^ and define the map X t by \i(X) = X(ofX). For any 
ft e fc we have X^X) = X(oflX) = X^a^-aX) = (aflO"^^). 
The map \i is, therefore, semi-linear and its isomorphism ^ is given 
by x ^ = (axa~*Y- K A 7*0, then X t (A) = X(A) = M X(A) which 
shows that X^A) = X(A) so that \i and X induce the same collineation. 
The proof of our theorem is complete. 

For any semi-linear map X, let us denote its isomorphism by X. 
Suppose now V ^> V ^> V" is a succession of semi-linear maps. Then 



We see that X 2 X! is semi-linear again and that X 2 Xi = X 2 X t . 

Very interesting results are obtained in the case of collineations 
of a protective space V onto itself. The semi-linear maps of V onto 
itself form a group 8; the collineations of V form also a group which 
shall be denoted by PS. Each element X of S induces a collineation 
of V onto itself. The map S -A PS which associates with each X e S 
the collineation j(\) induced by it (j(\) is merely the effect X has on 
the projective space V) is clearly a homomorphism of S onto PS; 
it is an onto map by the fundamental theorem of projective geometry. 
The kernel of j consists of those X e S which keep every line of V 
fixed and we have seen that there is one such map X for each a 7^ 
of k, hence for each a 1 7c*, namely the map \ a (X) = aX. We may 
then map fc* into S by a -^ \ a ; since \ a \ ft = \ aft , this map i is an 
isomorphism of k* into S and the image of i will be the kernel of j. 
It is customary to express such a situation by a diagram 

1 -> fc* A S -4 PiS -> 1 

where two entirely trivial maps are added: every element of PS is 
mapped onto 1, and 1 > ft* maps 1 onto the unit element of A;*. 
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In this succession the image of each map is the kernel of the next 
map; if this situation occurs one calls the sequence of maps an 
exact sequence. Thus the statement that our sequence is exact 
contains all the previous ones, i.e., that i is an isomorphism, that 
its image is the kernel of j and that j is onto. 

Denote now by A (k) the automorphisms of k. We can map 
S -A A(k), where p denotes the map X > X which associates with 
each semi-linear map X the automorphism X. The map p is a homo- 
morphism as we have seen and it is an onto map since every auto- 
morphism is possible in some semi-linear map. 

The kernel GL (general linear group) of the map p consists of 
those elements of S for which the automorphism is identity. They 
are merely our old homomorphisms of V onto V. We have, therefore, 
another exact sequence 

1 -> GL ^ S -A A(k) -> 1 

where inj indicates the mere injection map of GL into S. 

Each element of GL induces on V a special collineation called 
project! ve transformation of V. These transformations form a sub- 
group of PS denoted by PGL (projective general linear group), and 
the map of GL onto PGL is just the restriction of j to GL. We shall 
(rather sloppily) call it j again. The kernel consists of those X a 
which are homomorphisms. They must satisfy \ a (xX) = x\ a (X) 
or ax = xa for all x e k. This means that a comes from the center 
Z of k, hence from Z*, since a ? 0. We get the exact sequence 



Denote by I(k) the inner automorphisms of k. An element of 
PS determines an automorphism of k only up to an inner one so that 
we get only an element of the factor group A(k)/I(k) associated 
with each element of PS. Let us denote by p' this onto map 
PS -^ A(k)/I(k). Its kernel consists of the collineations which 
are induced by elements of S belonging to an inner automorphism. 
But since one can change the automorphism by any inner one we can 
assume that the element of S belongs to the identity automorphism, 
that it is in GL. This shows that PGL is the kernel of p' and another 
exact sequence may be written down. 

The situation will become less confusing if the reader studies the 
following diagram which incorporates all the statements we have 



CHAPTER II 93 

made and explains in full detail the consequences of the fundamental 
theorem of projective geometry in the case of collineations of the 
space V onto itself: 

1 
I 
PGL ->1 



PS ->1 

I*' 

- 1 



1 

1 


1 

1 


l- Z* 


-4 GL -i 


4-- 


4- 


l- k* 


.A s 


I" 


i' 


I - I(K) 


^ A(K) - 


1 

1 


I 

1 



I 

1 

The symbol "inj" indicates a mere injection map, "can" the 
canonical map onto the factor group. The map p associates with 
a t fc* the inner automorphism x > axcT 1 ; its kernel is clearly Z*. 

Every row and every column is an exact sequence. Every square 
of the diagram is "commutative"; this means for instance for the 
square whose left upper corner is S that the map j followed by 
p' gives the same result as p followed by can, i.e., p'j = can p. 

In later chapters we shall study subgroups F of GL. The effect 
which F has on the projective space V, therefore the image of F 
under j, will be denoted by PF. The kernel of this onto map F > PF 
consists of those \ a which belong to F. In all cases which we shall 
consider this kernel will be the center of F, and this will allow us to 
define PF as the factor group of F modulo its center. 

There is a geometric construction called projection (or perspec- 
tivity) which dominated the classical projective geometry and which 
will give us still a better understanding of the fundamental theorem 
of projective geometry. 

Assume that both spaces V and V are subspaces of a space ft 
of dimension N > dim V = dim V = n > 2; notice that we allow 
n = 2. Let T be a subspace of ft which is supplementary to V as 
well as to V: 

a = v T = v 9 r, v n T = v n T = o. 
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Let ,W be a subspace of Q which contains T and intersect W 
with F: W n F = C7. Clearly 7 + T 7 C TF but the reverse inclusion 
is also true. Any vector of 12 = V T has the form A = B + C 
with 7? e F and C e T. If A e W, then B =* A - C lies in TF and F, 
therefore in C7; consequently A e U + T and we obtain W = C7 + T. 
If we start with any subspace U of F and set W = C7 + T, then 
[7 C TF Pi 7; if A t W H F C TF, we can write A = J5 + C where 
J5 e 7 and C t T. The vector C = A - B lies in F as well as in T 
and, since V C\ T = 0, we get C = 0, A = B t U. 

For subspaces C7 of F and spaces W which contain T the two 
equations 

U = WC\V and TF=(7 + T 

are, therefore, equivalent. 

We start now with any U C F, form TF = ?7 + T 7 and intersect TF 
with V: U f = TF H F'. This gives a one-to-one correspondence 
between the subspaces 7 of F and the subspaces U' of V which is 
completely described by the one equation 

U + T = U' + T. 

Since U l C U 2 implies U{ C t^2 this one-to-one correspondence 
is a collineation of V onto F' which one calls projection of V onto 
V' with center !T. 

We allow also n = 2 and can, therefore, not make use of the 
fundamental theorem of projective geometry. We proceed directly. 
Let A l , A 2 , , A n be a basis of F and denote by (AJ) the image 
of the line (A.). Then (A,> + T = (A<> + T and this leads to a 
formula A t - = a t -AJ + B t where B t e T 7 ; since A { $ T the element 
ai is 7^ 0. We may replace AJ by the equivalent vector a.AJ , thereby 
simplifying the formula to A t = AJ + JB t - . This leads to x^A l + 
+ x n A n = XiA{ + - + x n A' n + C with C t T. If we denote 
by X the homomorphism (with respect to the identity automorphism 
of k) which sends A t onto A( , we can write \(X) X tT and obtain 

(X(X)> + T = <X> + T. 



This formula shows that the linear map X induces on F our pro- 
jection. The map X : F - V is not an arbitrary homomorphism. 
Should X e F Pi V then (X) + T = (X) + T (in different inter- 
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pretations) shows that (X) is kept fixed by the projection; X(X) X 
is, therefore, not only in T but also in V and V C\ T = implies 
X(X) = X, our map X keeps every vector of V H V fixed. 

Let us start conversely with a linear map X : V > 7' which is 
onto and which keeps every vector of V C\ V fixed. Denote by 
TO the set of all vectors of the form X(X) X where X ranges 
over V. It is obvious that T Q is a subspace of Q; its intersection with 
V is 0. Indeed, if X(X) - X e 7, then X(X) e X + V = 7; since 
X(X) e F' we have X(X) e F H 7' and consequently, X(X) = * 
showing that our vector X(X) X = 0. Similarly we can prove 

T Q r\ v = o. 

We have !F C 7' + 7 and, therefore, 7 + 7' C V + 7. If 
X(X) is any vector in 7', then 

X(X) = (X(X) - X) + X e To + 7. 

This shows 7 + 7' C 2 o + 7, and, therefore, 7 + 7' = T + V. 
Similarly V + T Q = 7 + 7'; since 7 H T 7' H 3T = we can 
write 7 + V = 7 T = 7' T Q . Let ^ be a subspace of fl 
which is supplementary to 7 + 7'. Set T = T T t . Then 

7 To T, = 7 T = Q = 7' T. 

We may project from T . Since X(X) - X e T C T we get <X> + T = 
(X(X)} + T, and this shows that our X will induce the projection 
from T. 

The condition that X(X) = X for all X e 7 n_7' has no direct 
geometric meaning in the projective spaces 7 and V where individual 
vectors of 12 do not appear. Let us see if we can replace it by a condi- 
tion which has a geometric meaning. The projection will leave every 
line of 7 n 7', i.e., every J^point" of 7 C\ V fixed. Suppose now 
that a collineation <r : 7 V is given which leaves every "point" of 
7 C\ V fixed. Does it come from a projection? Let us assume 
dim(7 H 7') > 2. Should n = 2, then this is only possible if 7 = 7'; 
our or must be identity and is clearly induced by the identity map X, 
i.e., a comes from a projection trivially. If n > 3, we may use the 
fundamental theorem of projective geometry. The collineation <r is 
induced by a semi-linear map X and this map X will keep every line 
of 7 n V fixed. For X t 7 C\ V we will get X(X) = aX and can 
prove that a is the same for all X e 7 C\ V as in the fundamental 
theorem of projective geometry by comparing independent vectors 
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of the space V C\V (whose dimension is > 2). We may now change 
the map X by the factor of 1 and the new map X will still induce a 
but satisfy \(X) = X for all X e V H V . Denote by /z the auto- 
morphism of k to which X belongs. For an X ?* of V H V we have 
\(aX) = a"\(X) = tfX on one hand and X(aX) = aX on the other. 
This proves that /z is the identity automorphism, i.e., that X is an 
ordinary homomorphism of V onto V. Since it keeps every vector 
of V C\ V fixed, our collineation is indeed a projection. We postpone 
for a while the discussion of the cases where dim(F C\ V) < 1. 

In classical synthetic projective geometry a collineation of two 
subspaces V and V of Q is called a projectivity if it can be obtained 
by a sequence of projections. Each of these projections (possibly 
onto other subspaces) is induced_by a linear map and consequently 
also the projectivity of V onto V. Let us consider first the special 
case of a projectivity of V onto V. It is induced by a linear map, 
therefore, by an element of GL(V) and is consequently an element 
of PGL(V). The question arises whether every element of PGL(V) 
is a projectivity. Let H be any hyperplane of V. Since N > n we 
can find a subspace V of 12 which intersects V in the hyperplane H. 
We can find a linear map of V onto V which is identity on H and 
carries a given vector A t V which is outside H onto a given vector 
B of V which is outside H. This map will come from a projection. 
We can then project back onto V and move B onto any given vector 
A' of V which is outside H. The combined map is a projectivity 
of V onto V which is induced by a map X : V > V such that X is 
identity on // and such that X(A) = A 1 . We can, for instance, 
achieve a "stretching" X(A) = a A of the vector A or we can also 
achieve X(A) = A + C where C is any vector of H. In the early 
part of Chapter IV we are going to see that every element of GL(V) 
can be obtained by a succession of maps of this type (with varying 
hyperplanes H). If we anticipate this result then we can say that 
the projectivities of V onto V are merely the elements of PGL(V). 
If V and V are different spaces we may first project V onto V in 
some fashion and follow it by ajyojectivity of V onto V, hence 
by an arbitrary element of PGL(V). It is clear nowjbhat any linear 
map of V onto V induces a projectivity of V onto V. This explains 
the dominant role played by the group PGL(V) in the synthetic 
approach. 

We return to the question which collineations of a space V onto 
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a space V are projections from a suitable center. One has certainly 
to assume that every "point" of V C\ V remains fixed and we have 
seen that this is enough if dim(F C\ V) > 2. In order to dispose 
of the remaining cases we have to strengthen the assumption. We 
shall assume that the given collineation cr is a projectivity, i.e., that 
it is obtained by a sequence of projections. This implies the simplifi- 
cation that a is induced by a linear map X of V onto V and, as we 
have seen, is equivalent with this statement. 

If V C\ V = 0, then no condition on X was necessary to construct 
the center T of the projection. This case is, therefore, settled in the 
affirmative. 

_There remains the case where dim(F C\ V) = 1, when V and 
V intersect in a point. Let (A) = V C\ V. Since <r(A) = (A) we 
must have A (A) = a A with a 9* 0. If, conversely, X is a linear map 
of V onto V for which A (A) = a A (and a can have any non-zero 
value in k for a suitable X) then X will induce a projectivity a- of V 
onto V which keeps (A) fixed. Once a is given, one has only a little 
freedom left in the choice of X. We may replace \(X) by \i(X) = 
\((3X) provided Xi is again a linear map. But this implies that ft is 
taken from the center of k. We must now see whether we can achieve 
\i(A) A for a suitable choice of ft. This leads to the equation 
fta. = 1 and we see that such a ft can only be found if a is in the 
center of k. Since a. might take on any value one has to assume that 
k is commutative. We gather our results in the language of projective 
geometry. 

THEOREM 2.27. Let & be a projective space, V and V^jproper 
subspaces of equal dimension and or a collineation of V onto V which 
leaves all points of V H V fixed. We can state: 

1) If dinip (V n V) > 1, then cr is a projection of V onto V 
from a suitable center T (<r is a perspectivity) . 

2) // V and V have an empty intersection and if cr is a projectivity, 
then cr is again a projection from a suitable center. 

3) // V and V intersect in a point and if & is a projectivity, then 
it is always a projection if the field k is commutative. If k is not com- 
mutative, then this will not always be true. __ 

The most important special case of this theorem, when J2 is a 
plane and 7, V distinct lines leads to thejfchird of our alternatives. 
A projectivity of the line V onto the line V which keeps the point 
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of intersection fixed is in general a projection only if the field is 
commutative. It depends on the configuration of Pappus. 

The question whether every collineation of a space V onto itself 
is a projectivity has a quite different answer. One must assume 
dim p V > 2 and now the problem reduces to PGL = PS; this is 
equivalent with A(fc) = I(k) by our diagram, every automorphism 
of k must be inner. 

In the classical case fc = R, the field of real numbers, both con- 
ditions are satisfied. The field R is commutative and identity is 
the only automorphism of R. 

Our last theorem in this paragraph concerns the characterisation 
of the identity. 

THEOREM 2.28. Let V be a protective space over a commutative 
field k. Let a be a projectivity of V onto V which leaves n + I points 
fixed (n = dim V) and assume no n of these points lie on a hyperplane. 
We assert that a is the identity. This theorem is false if k is not commuta- 
tive. 

Proof: Let (A ), (A^, , (A n ) be the n + 1 points. The vectors 
AI , A 2 , , A n are a basis of V and if A = 2<-i a <A< then a t ^ 0. 
These statements follow easily from the assumption about the points. 
If we replace A t by a, A t we obtain the simpler formula A = ]C"-i ^* 

The projectivity <r is induced by a linear map X and since a leaves 
our points fixed, we must have X(A t ) = /3,-A, and consequently 
X(A ) = )?-i faA-i But X(A ) = PoA whence & = &> , all fr, are 
equal. From X(A t ) = 0A t - we deduce for any vector X that \(X) = f$X 
since k is commutative. This was our contention. If k is not commuta- 
tive select a, ft e fc such that aft j (la. Let X be the linear map which 
sends A t - into A, (i = 1, , n). The vector A = )?-i ^- i g mapped 
onto A . The collineation a which X induces leaves the points 
(A,) fixed. The point (X) = (A! + 0A 2 ) is moved into (aA l + /taA 2 ) = 
(A! + a~ l ($aA 2 ) which is different from (X). 

11. The projectile plane 

Consider an affine Desarguian plane and let k be its field. Let V 
be a three dimensional left vector space over fc and W a plane of V. 
We shall construct a new Desarguian plane Tl w as follows: 

The "points" of n^ shall be those lines L of V which are not 
contained in W. 
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The "lines" of tt w shall be those planes U of V which are ^ W . 
The incidence relation shall merely be L C U. 

Let W = (A 2 , ^ 3 ) and V = (A, , A 2 , A 3 >. If L C W, then L is 
spanned by a unique vector ^! + %A 2 + rjA 3 and we shall associate 
( , ry) as coordinates to L. 

A "line" U will have an intersection (aA 2 + 0A 3 ) with W. U can 
be spanned by <xA 2 + #A 3 and by some vector ^Li + yA 2 + dA 3 . 
Since L ( TF, a "point" L on U is spanned by a unique vector of the 
form (Ai + yA 2 + dA 3 ) + t(aA 2 + f}A 3 ). The coordinates of L 
are, therefore, 

(ta + y,tp+ S) = *(,$ + (7, 5). 

This is the equation of a line in parametric form and the only restric- 
tion is that (a, /5) ~ (0, 0). We see that II ^ is coordinatized by the 
field k and that it has, therefore, the same structure as our original 
Desarguian plane. 

The plane II ^ is obtained from the projective plane V by deleting 
the "line" W and all the "points" on W. 

We thus have the following picture: If we take a projective plane 
V and eliminate one line and all the points on it, we get a Desarguian 
plane. All the Desarguian planes which can be obtained from V by 
this process have the "same" type of affine geometry. If two lines of V 
meet on a point of the deleted line W, then they do not meet in n^ and 
are, therefore, parallel. Each deleted point is characterized by a 




Figure 5 
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pencil ,of parallel lines of U w . We call them "infinite points" of 
U w and W itself the infinite line of II ^ . 

From the affine configurations we can get project! ve configurations 
and prove them by deleting appropriate lines. 

Figure 5 1 indicates a project! ve configuration. If we delete 
the line X it becomes the affine Desargues: The lines l v , 1 2 and the 
lines w x , m 2 become parallel so that the lines HI , n 2 are also parallel; 
they must, therefore, meet in a point on the line X. This configuration 
is known as the projective Desargues. It contains of course the 
affine Desargues as a special case if X is the "infinite" line. 



o 




A c B D 

Figure 6 

Figure 6 is another such configuration. Draw a line X through 
the point Q which does not go through A, B, C. If we delete the 
line X it becomes the construction of the fourth harmonic point D. 
The fact that the construction is independent in its affine form 
means that the point Q could be moved to any other point on X, 
hence to any point Q' in V such that the line through Q and Q' 
does not go through A, B, or C (provided of course it is outside I). 
By two successive moves of this type we can bring it into any position. 
Thus the point D is always the same no matter how P and Q are 
chosen. This fact is known as the theorem of the complete quadri- 
lateral. 

The reader may find for himself the projective Pappus. 

Any two distinct lines of V meet in a unique point and through 
distinct points of V there goes exactly one line. Each line of V 
contains at least three points. 

figure 5 is meant to be in a plane. For a later purpose it is drawn in such a way 
that it could be interpreted in space. 
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It is now clear on what axioms one would base projective geometry 
in a plane: 

1) Two distinct points determine a unique line on which they lie. 

2) Two distinct lines intersect in exactly one point. 

3) Each line contains at least 3 points and there are 3 non-collinear 
points. 

4) The projective Desargues. 

If V is a projective plane of this kind, then one deletes a line and 
all the points on it. One obtains an affine plane in which the affine 
Desargues holds. This affine plane can be coordinatized by a field fc. 
With this field k one can construct a projective plane of the same 
structure as the given one. It is to be remarked, however, that the 
field is not canonically constructed, that it involves the choice of 
the deleted line. A canonical field associated with the plane would 
have to be explained as an equivalence class of fields (one for each 
deleted line). 

We have never mentioned the axioms for affine or projective 
spaces of a dimension n > 2. It is approximately clear how these 
axioms would look. In the projective case the incidence axioms 
might be the rules connecting dimensions of intersection and sum, 
in the affine case the rules connecting the dimensions of intersection 
and join. We had given these rules in Chapter I. If n > 2 no Desargues 
is necessary, one can prove it from the incidence axioms. One look 
at Figure 5 suggests viewing the configuration as the projection of 
a three-dimensional configuration onto a plane. The three-dimensional 
configuration is easily proved. The reader should work out the details 
of such a proof. The theorem of Desargues is, therefore, the necessary 
and sufficient condition for a plane geometry to be extendable to a 
geometry in 3 (or more) dimensions. Indeed, one can prove Desargues 
if n > 2 and, on the other hand, coordinatize the plane if Desargues 
holds; but such a plane can clearly be imbedded in a higher dimen- 
sional space. This is again another geometric interpretation for 
Desargues. 

If one replaces Desargues' theorem by Pappus, then, as Hessen- 
berg has shown, Desargues can be implied from Pappus. It would 
be nice to have a proof of this fact in the style of our presentation. 

One may also think of replacing Desargues' theorem by the theo- 
rem of the complete quadrilateral. Very interesting results have been 
obtained which the reader may look up in the literature. 
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EXERCISES. We can of course also associate a projective space 
V with a right vector space. 

Let now V and V be two vector spaces over fields k respectively 
k' (V can be right and V either left or right, or V can be left and 
V either left or right). Assume they have both the same dimension. 
Let U > U r be a one-to-one correspondence (onto) of the subspaces 
of V and the subspaces of V. We shall call it 

a collineation of V and V if C/i C U 2 implies U( C U 2 , 
a correlation of V and V if Ui C U 2 implies U( D Ui . 

1) Let F be a left vector space over k. Find a right vector space 
V over some field k' such that there exists a collineation of V" onto 
T'. 

2) To each left vector space V there exists a canonically constructed 
right vector space V over the same field k together with a canonical 
correlation V > V. 

3) Use 1) and 2) to describe all collineations and correlations, 
making also use of the fundamental theorem of projective geometry. 

4) Suppose there exists a correlation V > V (the same space). 
What is the necessary and sufficient condition on the field k of 7? 
Consider the group of all correlations and collineations of V > V 
and investigate its structure. Draw diagrams. 

Let T be an antiautomorphism of k (if k is commutative every 
automorphism is also an antiautomorphism). Call a "product" 
V X V k a generalized pairing of V and V into k if the two dis- 
tributive laws hold and if in case V is a right vector space the 
following conditions are satisfied: 

X(Ya) = (XY)a, (Xd)Y = a r (XY). 

Making use of the dual V of V one can easily describe the laws 
governing such a pairing. 

5) Suppose a correlation V > V exists. Prove that there exists 
a generalized pairing of V and V into k which allows us to describe 
the correlation algebraically. Find all pairings describing a given 
correlation. 

6) Consider a correlation between two projective planes. Give 
the images of the configurations of Desargues, Pappus and the 
complete quadrilateral. Does this constitute a proof of the image 
configurations? 
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7) Let II and II' be two Desarguian affine planes, k respectively 
k' the fields of trace-preserving homomorphisms of the translation 
groups T respectively T'. Let X : II > n' be a one-to-one map of 
n onto II' which preserves lines. Prove: 

a) XTX" 1 = T'. 

b) For each a t k the map 



is a trace-preserving homomorphism of T 7 , hence a certain element 
of &' which we may denote by a* . 

c) The map X : k > k' given by a a* is an isomorphism of k 
onto k'. 

d) If coordinates are given in n then the coordinate system of II' 
can be placed in such a position that the image of the point (x, y) 
in II is the point (a;*, y*) of II'. 

e) Describe all such maps X by formulas. 

8) Using Figure 6 give a sequence of three projections which 
thereby produce a projectivity a of the line I through A, R onto itself 
with the following properties: A and B are fixed points; if C is distinct 
from A and B then its image under a is the fourth harmonic point D. 
What is the order of the projectivity <r? 

9) Let k be a commutative field of characteristic 5^ 2, V a vector 
space over k and V the corresponding projective space. Let be a 
projectivity of V onto itself which is of order 2 and has at least one 
fixed "point". Prove that among the linear maps of V onto itself 
which induce <r there is one call it X which is also of order 2. 
Show that V is the direct sum of two subspaces U and W such that 
X keeps every vector of U fixed and reverses every vector of W. 
The projectivity a- is completely described by the pair 17, W of 
subspaces (U, W equivalent to TT, U). 

10) Let k be commutative and of characteristic 7* 2. Show that 
a projectivity of a line I which is of order 2 and which has a fixed 
point is by necessity a map of the type described in 8). Avoid any 
computation. Returning to 9) describe the projectivity or by geo- 
metric constructions. Assume that k is the field of quaternions. 
Give a projectivity of a line which is of order 2 and is not of this 
type. 



CHAPTER III 
Symplectic and Orthogonal Geometry 

1. Metric structures on vector spaces 

This chapter presupposes a knowledge of Chapter I, 2-4. We 
will deal exclusively with vector spaces V of finite dimension n 
over a commutative field k. A left space V shall be made canonically 
into a two-sided space by the definition 

Xa aX , a e k, X e V. 

Since V has become a left space as well as a right space we can 
consider pairings of V and V into k. In such a pairing a product 
XY e k is defined for all X, Y e V and will now satisfy the rules 

XY, , 



X(aY) = a(XY) 

(the last rule since aY = Fa and since fc is commutative). 

The reader should think intuitively of X 2 = XX as something 
like "length" of the vector X and of XY as something related to 
the "angle" between X and Y. We shall say that such a product 
defines a "metric structure" on V and investigate first how such a 
structure can be described in terms of a basis A^ , A 2 , , ^l n of V. 

We set 

(3.2) AtA t = 0,,-e/c, i,j = 1,2, -- , n. 
Let 

(3.3) X = Z aU, , F = Z ffrA, 

F-l M-l 

be any two vectors. The rules (3.1) allow us to express their product: 

(3.4) XY = E g^y, , 

r.M-1 

which shows that we know XY if the g if are known. 

105 
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Let us select conversely arbitrary elements g fi in A; and define 
a product XY (of vectors X and F given by (3.3)) by means of (3.4). 
This product satisfies obviously the rules (3.1) and is, therefore, 
a pairing. If, in (3.4), we specialise X to A< and F to A f we again get 
(3.2). A function of the variables #, and y^ of the type (3.4) is called 
bilinear. Hence the study of bilinear functions is equivalent with a 
study of metric structures on V. 

For a given pairing the g if will depend on the choice of the basis. 
Let us see how a change of the basis will affect the g ti . Let B l , B 2 , 
, B n be a new basis of V\ then B f = ]C"-i A,a 9i with certain 
an c fc. The BJ will form a basis of V if and only if the determinant 
of the matrix (a,-,-) is 7* 0. With the new basis we construct the 
new (/7,: 



In matrix form this can be written as 

(3.5) (<7^) = (<*i<) 07<i)( a <f) 

where (a/,) is the transpose of the matrix (a.,-)- 

Our pairing will have a left kernel U Q and a right kernel F . 
They are now of course subspaces of the same space V. We know 
dim V/Uo = dim V/V which implies that U Q and V have the same 
dimension. Let us compute U in terms of our basis A 9 . A vector 
X = ;_, X > A * wil1 be in U if and only if XY = for all F. Then 
certainly XA,- = (j = 1, 2, , n). Conversely, if XA,- = for 
j = 1, 2, , n we get X- (2J-i Z/M^M) = 0. X will be in 7 if and 
only if 

(3.6) 0,,*, = 0, j-1,2, -.-.n. 

F-l 

We are especially interested in the question: when will U be merely 
the subspace of F? Since V has the same dimension this also 
implies that V Q = 0. The equations (3.6) should only have the trivial 
solution and this is the case if and only if the determinant of the 
matrix (</,-,) is different from 0. We shall use a certain terminology. 

DEFINITION 3.1. We call a vector space V with a metric structure 
non-singular if the kernels of the pairing are 0. The determinant 
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G = det(0,,) shall be called the discriminant of V. The necessary 
and sufficient condition for V to be non-singular is G 7* 0. 

Let us return to (3.5) and denote by G the determinant of (g^) 
(hence the discriminant of V as defined by the basis B w ). Taking 
determinants in (3.5) we obtain 



(3.7) <? = ( 

The geometric set-up alone determines, therefore, the discriminant 
of V only up to a square factor. 

DEFINITION 3.2. Let V and W be vector spaces over k, and <r: 
V > W be an isomorphism of V into W. Suppose that metric struc- 
tures are defined on both V and W. We shall call a an isometry of 
V into W if a "preserves products": 

XY = (<rX)(cry) for all X,YtV. 

The most important case is that of isometrics of V onto V. Then 
o-" 1 is also such an isometry and if <r and T are isometrics of V onto 
V so is or. 

DEFINITION 3.3. Let V be a space with a metric structure. 
The isometrics of V onto F form a group which we will call the group 
of V. 

Let o- be an endomorphism of V into V and A, a basis of V. Put 
aA, = B t ; if o- is an isometry, then we must have B t S, = A t A, 
(i, j = 1, 2, , ri). Suppose, conversely, that these equations hold. 
If X = E?! x,A v and F = E"-i U^^ then 



and 



We have 

n n 

r.M-1 *,M-1 

The map cr will be an isometry if we can show (or assume if necessary) 
that the kernel of <r is 0. If V is non-singular this follows auto- 
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matically. Indeed, if <rX = 0, then certainly aX-aY = for all 
Y e V. But <rX<rY = XY so that XY = for all Y e V. This means 
that X is in the left kernel U and therefore 0, if we assume V to be 
non-singular. 

THEOREM 3.1. Let <r be an endomorphism V -* V, A t a basis 
of V and aA t = 5, ; <r will be an isometry if and only if AiA f = #,-#, 
for all i, j ~ 1, 2, , n and if the kernel of <r is 0. This last condition 
is unnecessary if V is non-singular. 

If we write B, = ]>3?-i A,a vi , then the matrix (a,-,-) is the one 
describing the endomorphism a in the sense of Chapter I, 3. Since 
B t B f = ~g7j = QH for an isometry, we get from (3.5) and (3.7) 



(3.8) 

(3.9) G = (det(a,,.)) 2 -(? = (det a) 2 -(?. 

Should F be non-singular, then G 7* and we get det <r = 1. 

THEOREM 3.2. // V is non-singular and o- an isometry of V onto 
V, then det <r = 1. // det o- = +1, then is called a rotation; if 
det o- = 1, /Aen w caM a a reflexion. The rotations form an invariant 
subgroup of the group of V whose index is at most 2. 

The last part follows from the fact that the map a > det <r is a 
homomorphism of the group of V whose kernel are the rotations 
and whose image is either 1 or just +1 (in case there are no 
reflections or if the characteristic of ft is 2). 

If we form X 2 = X-X we find from (3.4) 

(3.10) X 2 = E gvXM . 

<,'-! 

This is an expression which depends quadratically on the x, and 
is, therefore, called a quadratic form. The coefficient of x? is g i{ , 
the coefficient of x^x,- is g if + g fi if i ^ j. We notice that we can 
select the g if in such a way that X 2 becomes a given quadratic 
form; it is even possible in several ways. 

One can define a quadratic form intrinsically (without reference 
to a basis) as a map Q of V into k (but not a linear one) which satisfies 
the two conditions: 



1) Q(aX) = a 2 Q(X), a t k, XeV. 

2) The function of two variables X, Y e V given by 
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Q(X + Y) - Q(X) - Q(7), 

which we shall denote by X o Y, is a pairing of V and F into k. 

Indeed, if we start with an arbitrary pairing XY and define 

Q(X) = X 2 , then this Q(X) satisfies condition 1). For X o Y one finds 

X o Y = (X + 7) 2 - X 2 - F 2 = XY + YX. 

It will satisfy condition 2). We remark, however, that X o F is not 
the original pairing. 

Suppose now, conversely, that Q(X) satisfies the two conditions. 
If we put X = X l + X 2 + + X^ and Y = X r in condition 2) 
we find 

Q(X l + X 2 + + X r _0 



If we use induction on r we get 
+ X, + + X r ) = 



Let Aj , A 2 , , A n be a basis of V and let X 
Then 

Q(X) = 

i 

This shows that Q(X) depends indeed quadratically on the z< . 
The pairing 

(3.11) X o Y = Q(X + Y) - Q(X) - Q(F), 

which we get from a quadratic form, is very special. If we put Y = X 
in (3.11) and notice that Q(2X) = 4Q(Z) we obtain 

(3.12) XoX = 2Q(X). 
Equation (3.11) shows also that 

(3.13) X o Y = Y o X. 

We distinguish now two cases: 

a) The characteristic of k is ^ 2. Then we can write Q(X) = 
o X) and see that the quadratic form differs inessentially from 
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X o X. The pairing X o Y is symmetric. Can there be another 
symmetric pairing XY = YX for which we also have Q(X) = $X-X1 
We would then get 

Q(X + 7) - Q(X) - Q(Y) 

= j(x + Y)(X + 7) - jo: - jyy = x- Y 

which shows X-F = X oY. 

The quadratic form Q(X) determines, therefore, uniquely a 
symmetric pairing such that 

X-X = 2Q(X). 

If our aim is the study of quadratic forms we may just as well 
start with a symmetric pairing and call X 2 the corresponding quadratic 
form since it differs from Q(X) by the inessential constant factor 2. 

b) The situation is very different if the characteristic of k is 2. 
Equation (3.12) becomes X o X = 0. If we start with a symmetric 
pairing XY = YX and hope to obtain Q(X) from X 2 we do not 
succeed. Indeed, if X = ]C<-i ^^ > then 



since the two terms A ,4 ,,, and A J A t x J x l (for i 7* j) will cancel 
if the characteristic is two. One can for instance never obtain a 
term x& 2 . To start with an unsymmetric pairing is not a desirable 
procedure since such a pairing is not uniquely determined by the 
quadratic form. 

2. Definitions of symplectic and orthogonal geometry 

Suppose again that an arbitrary pairing of V and V into k is 
given. As in any pairing we shall call a vector A orthogonal to a 
vector B if AB = 0. This raises immediately the problem: In which 
metric structures does AB = imply that BA = 0? Let us suppose 
that V has this property and let A, B, C t V. Remembering that k 
is commutative we get 

A((AC)B - (AB)C) = (AC)(AB) - (AB)(AC) = 0. 

Hence 

((AC)B - (AE)C)A = 
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or 

(3.14) (AC)(BA) = (CA)(AB). 

For C = A we obtain A 2 -(BA) = A 2 -(AJ3). Should A 2 5* 
then we can conclude JBA = AB. In other words: 

If AB 7* BA then A 2 = and similarly B 2 = 0. 

Suppose now that V contains two special vectors A and B such 
that AB j BA. We intend to show that C 2 = for any vector C. 
This would certainly be true if AC ^ CA so that we may assume 
AC = CA. But (3.14) is then consistent with AB 9* BA only if 
AC = CA = 0. We may interchange A and B and see that we can 
also assume BC = CB = 0. But now 

(A + C)B = AB ^ BA = B(A + Q. 

Consequently (A + C) 2 = 0; since A 2 = (from AB 7* BA) and 
AC = CA = we get indeed C 2 = 0. We see that there are two types 
of metric structures with our property: 

1) The symplectic geometry. 
Here we postulate 

(3.15) X 2 = for all X e V. 

Replacing the vector X by X + Y we get X 2 + XY + YX + Y 2 = 0, 
hence 

(3.16) XY = -YX, X,Y* V. 

Equation (3.16) shows that AB = implies indeed BA = 0. 

From condition (3.16) one can not get (3.15) as the special case 
V = X, since k may have characteristic 2. 

For the g tj we obtain in a symplectic geometry 

(3.17) g^ = 0, g tj = -g ti . 

A bilinear form 5^?./.i g ti x i y j for which (3.17) holds is called skew 
symmetric. If (3.17) is satisfied, X 2 = ]C?.f-i 9n x * x i == which 
shows that (3.17) is the necessary and sufficient condition for a 
symplectic geometry and that such a geometry is equivalent with 
the study of skew symmetric bilinear forms. 

2) The orthogonal geometry. 

If V is not symplectic but if it has our property, then, by necessity, 

(3.18) XY - YX for all X, Yt V. 
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This is a symmetric pairing which we have also met a while ago in 
connection with quadratic forms. In this case AB = implies again 
BA = 0. 

If the characteristic of k is ^ 2, this geometry is entirely satisfactory 
since the symmetric pairings are in one-to-one correspondence with 
quadratic forms and one may simply say that X 2 is the quadratic 
form connected with our pairing. 

If, however, k has characteristic 2, then the symmetric pairings 
are not general enough. In this case one starts with any quadratic 
form Q(X) and defines a pairing by (3.11). But let us simplify the 
notation and write this pairing as XY. Hence 

(3.19) XY = Q(X + F) - Q(X) - Q(F). 
Equation (3.12) shows that 

(3.20) X 2 = 0. 

The underlying pairing is, therefore, symplectic (if the charac- 
teristic is 2, then there is no difference between (3.18) and (3.16)). 
In intuitive geometric language: V has a symplectic geometry 
refined by an additional quadratic form (measuring lengths if you 
wish) which is related to the symplectic geometry by (3.19). The 
geometry of V is called in both cases an orthogonal geometry. 

To repeat the connections with the quadratic form: (X + F) 2 = 
X 2 + Y 2 + 2XY if the characteristic ^ 2, Q(X + F) = Q(X) + 
Q(Y) + XY for characteristic 2. One may call these two formulas 
the Pythagorean theorem or also the "law of cosine' ' which they 
become in special cases. 

But we have to restrict ourselves. For the sake of simplicity 
we shall always assume tacitly that the field has a characteristic 
T^ 2 if we deal with an orthogonal geometry. In the case of a symplectic 
geometry we shall not make any restrictions. 

The interested reader may consult the quoted literature for more 
details on the case of an orthogonal geometry when the characteristic 
is 2. 

Appendix for readers who have worked on some exercises 
in Chapter II, 11. 

If fc is not necessarily commutative but has an antiautomorphism 
r, we may consider a generalized pairing of a right vector space 
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V with itself into fc and investigate the corresponding problem: 
Suppose XY = implies always YX = 0. Since the antiautomor- 
phism and the pairing are involved the discussion is a little harder. 
Assume dim V > 2 and consider first the case where the kernel of 
the pairing is 0. 

Let A, B, C e 7 such that AB 5* and AC ^ 0. From 

A(B-(AB)~ l - C-(AC)- 1 ) = 
we get 

(B-(AB)- 1 - C-(AC)-')A = 
which leads to 

(AB)~ r (BA) = (ACr r (CA) = a. 

We may write (BA) = (AB) r a where a depends only on A; for 
vectors B for which AB = BA = this equation is trivially true. 
One has now to show that a is independent of A. Let A l and A 2 be 
two independent vectors. There exists a vector B 1 which is orthogonal 
to A 2 but not to A! ; i.e., A l B l ? and A 2 B 1 = 0. Changing B l 
by a factor we may assume AiBi = 1, A 2 B l = 0. There exists also 
a vector B 2 such that A ,B 2 = 0, A 2 B 2 = 1. The vector B = BI + B 2 
satisfies AJZ = 1, A 2 # = 1. We know three equations: 

BA, = (A.BYa, , BA 2 = (A 2 J5) T 2 , 



2 



They give JSAj = ^ , BA 2 = 2 , i 2 = 0. Should A 1 and A 
be dependent vectors ?^ 0, then we compare them to a third inde- 
pendent vector and conclude that a is the same for all A 7* 0. For 
A = it does not matter what a we take. The result is: 

(YX) = (XYYa for all X, Y e V. 

Case I. X 2 = for all X e V. We deduce then, as in the symplectic 
case, YX = -(XY) and obtain -(XY) = (XY) T a. Selecting 
X and Y in such a way that XY = I we obtain a = 1, and are left 
with (XY) = (XY) T . But XY can range over all of k and we see 
that r=l, the identity. The identity can be an antiautomorphism 
only if the field fc is commutative. This means that our geometry 
is symplectic. 

Case II. There exists a special vector X Q such that Xl = 5^ 0. 
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Set X =s Y = -X" in our equation and we obtain a = (T T 0. Let us 
introduce a new pairing [XY] = pT l (XY). It differs very inessentially 
from the given pairing (just by a constant factor). We get 

0[YX] = (P(XY]yp- T = [XYYfi, 

[YX] = p- l [XY]'p. 
We have 

[(Ya)X] = P~\(Ya)X) = (T l a'(YX) = pT l a^[YX] 

which shows that the new pairing belongs to the antiautomorphism 
a > /TV/3 = a\ With X we can now write 

[YX] = [ZF]\ 

Interchanging once more: [YX] = [FJ?] X9 which implies that X 2 = 1, 
the identity. 

Making this replacement in our pairing we may, therefore, assume 
that r is an antiautomorphism of order at most 2 and that 

(YX) = (XY) r . 

If r = 1, then the field must be commutative and we are in the 
case of orthogonal geometry. 

If r is of order 2, then k may or may not be commutative. The 
corresponding geometry is called unitary and the form X 2 is called 
a hermitian form. 

If the pairing has a kernel V Q 7* 0, one has to assume dim V/V > 2 
and obtains the same result by going to the pairing induced on V/V . 

EXERCISE. Each of our geometries with kernel induces on the 
projective space V a correlation of order 2 and each correlation of 
order 2 is induced by one of our geometries. By what geometric 
feature is the symplectic correlation distinguished from the others? 
Is the name "symplectic" somewhat justified? Suppose the geometry 
is either unitary or orthogonal. What does it mean for a special 
vector to satisfy X 2 = in terms of the correlation? This leads to 
the classical definition of a conic in projective geometry. 

3. Common features of orthogonal and symplectic geometry 

In this paragraph we will study spaces with either an orthogonal 
or a symplectic geometry. In either case orthogonality of vectors 
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or of subspaces is unambiguously defined. If U is a subspace of F, 
then the orthogonal subspace U* has a unique meaning. The two 
kernels of the pairing are the same, they are the space V*. 

DEFINITION 3.4. The kernel V* of V is called the radical of V 
and denoted by rad V. 

If U is a, subspace of V, then the pairing of V induces by restriction 
a pairing of U which will be of the same type as the one of V orthog- 
onal or sympletic. U itself has a radical consisting of those vectors 
of U* which are in U. In other words 

(3.21) rad U = U C\ U*, U C V. 
DEFINITION 3.5. If V is the direct sum 

(3.22) V = U, C/2 - - U r 

of subspaces which are mutually orthogonal, then we shall say that 
V is the orthogonal sum of the Ui and use the symbol 

(3.23) V = U, U 2 J_ - - - _L U r . 

Let V be a vector space which is a direct sum (3.22) of subspaces 
Ui . Suppose that a geometric structure is given to each subspace. 
Then there is a unique way to extend these structures to one of V 
such that V becomes the orthogonal sum of the U % . 

If 



are vectors of V and A t - , B t - e Ui ; , then we have obviously to define 
(3.24) XY = A,B, + A 2 B 2 + - + A T B, . 

The reader will have no difficulty in proving that (3.24) defines a 
pairing on V and that V will have a symplectic, respectively, ortho- 
gonal geometry if all the Ui are symplectic, respectively, orthogonal. 
The geometry of V induces on each C7 t - its original geometry and 
Ui and U f are orthogonal if i 9* j. 

Suppose V = t/i + U* + + U r , U< orthogonal to U, if 
i j& j 9 but do not assume that the sum is direct. Let X = ]T]7-i A, , 
Ai * Ui , and assume X e rad V. Then we must have XBi = for all 
Bi e Ui which gives A t Bi = or A t e rad Ui . Conversely, if each 
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Ai t rad C7 t , then X t rad V. In other words, if the (7, are mutually 
orthogonal, then, 

(3.25) raxl V = rad U l + rad U 2 + - + rad U r . 

Should each U t be non-singular, i.e., rad U t = 0, then we obtain 
rad V = 0, V is non-singular. But in this case our sum is direct. 
Indeed if ]C"-i ^ = we obtain AiB t = for any B % e [/, . Hence 
A t t rad U t = 0. We can, therefore, write (3.23) if each C7, is non- 
singular. 

Consider the subspace rad V of V and let U be a supplementary 
subspace 

V = rad V U; 
rad V is orthogonal to V and, therefore, to E7 and we get 

(3.26) V = rad V U. 
We deduce 

rad V = rad(rad 7) JL rad C7 = rad V J_ rad U. 

Since this last sum is direct we must have rad U = 0. U is, therefore, 
non-singular. 

The geometry on V does not in general induce a geometry on a 
factor space. It does so, however, for the space F/rad V. It is natural 
to define as product of the coset(X + rad F) and the coset(F + rad V) 
the element XY: 

(3.27) (X + rad F)-(F + rad 7) = XY. 

In Theorem 1.3 we have mapped a supplementary space U isomor- 
phically onto the factor space F/rad V. The map was canonical 
and sent the vector X of U onto the coset X + rad V. The equation 
(3.27) means that this map is an isometry of U onto 7/rad V. Thus 
we have proved 

THEOREM 3.3. Each space U which is supplementary to rad V 
gives rise to an orthogonal splitting (3.26). U is non-singular and 
canonically isometric to F/rad F. 

DEFINITION 3.6. Let F = ZA JL U 2 J_ - - JL U r , V = U{ _L 
Ui JL J- Ur be orthogonal splittings of two spaces F and V 
and suppose that an isometry v t of C7, into U( is given for each i. If 



CHAPTER III 117 

X = ]5".i At with A* e C7, is a vector of V, then we can define a 
map <r of F into V by 

(3.29) <rX = *!/!, + <T 2 4 2 + - + (T r A r 

which is an isometry and shall be denoted by 
(3.30) <r = eri J. <r, _L JL r r . 

We shall call it the orthogonal sum of the maps <r, . 

The reader will, of course, have to check that a is an isomorphism 
of V into V and that scalar products are preserved. 



THEOREM 3.4. Let V = Ui J_ t7 2 _L f/ r and eacA o\ an 
isometry E7, on/0 C7 t . TAen /&e orthogonal sum 

a = 0*1 _L 02 JL J. cr r 
is an isometry of V onto V and we have 

(3.31) det a- = det v l det o- 2 ..... det a r . 

if 

T = TI JL T 2 JL ' ' JL T r 

where the r { are also isometries of [/, onto Ui then 

(3.32) (TT = VI-TI J_ o- 2 r 2 JL 1. o- r r f . 



The proof of this theorem is straight forward and can be left to 
the reader. 

THEOREM 3.5. Suppose that V is non-singular and let U be any 
subspace of V. We have always 

(3.33) 17** - U, dim 17 + dim C7* = dim F, 

(3.34) rad U = rad C7* = U H U*. 

The subspace U will be non-singular if and only if U* is non-singular. 
Should U be non-singular, then we have 

(3.35) V = U C7*. 

Finally, ifV= U J_ W, then U and W are non-singular and W = C7*. 

Proof: Since the kernels of our pairing are 0, we have (3.33) from 

the general theory of pairings. Formulas (3.33) and (3.21) imply 

(3.34). If U is non-singular, then (3.34) shows that U* is non-singular 
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and that the sum U + C7* is direct; since the dimensions fit, we get 
(3.35). If V = U TF, then W C U* and dim W = n - dim U - 
dim t/*; therefore W = U* and rad U = U H C7* = 0. 

DEFINITION 3.7. A space is called isotropic if all products be- 
tween vectors of the space are 0. The zero subspace of a space and 
the radical of a space are examples of isotropic subspaces. A vector 
A is called isotropic if A 2 = 0. In a symplectic geometry every 
vector is isotropic. 

THEOREM 3.6. Let V be a space with orthogonal geometry and 
suppose that every vector of V is isotropic. Then V is isotropic. 

Proof: Under our assumption this geometry is symplectic as well 
as orthogonal. We have, therefore, XY = -YX = YX. This 
implies XY = since we assume, in case of an orthogonal geometry, 
that the characteristic of k is ?* 2. 

The following special case plays an important role in the general 
theory. 

We assume that dim V = 2, that V is non-singular, but that V 
contains an isotropic vector N ?& 0. 

If A is any vector which is not contained in the line (N), then 
V = (AT, A). We shall try to determine another isotropic vector M 
such that NM = 1. Put M = xN + yA, then NM = yNA. If NA 
were 0, then N e rad V\ but we have assumed V to be non-singular. 
Therefore NA 7^ and we can determine y uniquely so that NM = 1. 
In the symplectic case M 2 = is automatically satisfied so that 
any x is possible. If V has orthogonal geometry, x must be determined 
from 

M 2 = = 2xyNA + y*A*. 

This is also possible since 2yNA 7* and leads to a uniquely 
determined x. 

We have now for both geometries 

V = (N, M), N 2 = M 2 = 0, NM = 1. 

Conversely, if V = (N, M) is a plane, we may impose on it a geometry 
by setting n = g 22 = and 12 = 1 (hence 21 = 1 in the orthogonal, 
21 = 1 in the symplectic case). Then N 2 = M' 2 = and NM = 1. 
If X = xN + yM e rad V, then XM = 0; hence x = and NX = 
which gives y = 0. V is non-singular. 
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Suppose V = (Af, M) has an orthogonal geometry. X = xN + yM 
will he isotropic if X 2 = 2xy = 0; hence either y = 0, X = xAT or 
x = 0, X = yM. 

DEFINITION 3.8. A non-singular plane which contains an isotropic 
vector shall be called a hyperbolic plane. It can always be spanned 
by a pair N, M of vectors which satisfy 

N 2 = M 2 = 0, NM = 1. 

We shall call any such ordered pair N, M a hyperbolic pair. If V 
is a non-singular plane with orthogonal geometry and N ^ is an 
isotropic vector of V, then there exists precisely one M in V such 
that AT, M is a hyperbolic pair. The vectors xN and z/Af are then the 
only isotropic vectors of V. 

DEFINITION 3.9. An orthogonal sum of hyperbolic planes 
PI > 1*2 9 , P r shall be called a hyperbolic space: 

H 2r = P l JL P 2 -L - - _L P r . 

It is non-singular and of course of even dimension 2r. 

We may call a space irreducible if it can not be written as an 
orthogonal sum of proper subspaces. Because of (3.26) we see that 
an irreducible space is necessarily either non-singular or isotropic. 
If it is isotropic, then its dimension is 1 since any direct splitting 
of an isotropic space is also an orthogonal splitting. To discuss the 
non-singular case we distinguish: 

1) Orthogonal geometry. Because of Theorem 3.6, V must contain 
a non-isotropic vector A. The subspace U = (A) is non-singular 
and (3.35) shows U* = 0, dim V = 1. 

2) Sympledic geometry. Let N 7* be any vector of V. Since 
rad 7 = 0, there exists an A t V such that NA 7* 0. The plane 
U = (AT, A) is non-singular and (3.35) shows again U* = 0, V = U. 

THEOREM 3.7. A space with orthogonal geometry is an orthogonal 
sum of lines 

V = (A,) JL (A,) JL . - - JL <4 n >. 

The A, are called an orthogonal basis of V. V is non-singular if and 
only if none of the 4 t - are isotropic. 

A non-singular symplectic space is an orthogonal sum of hyperbolic 
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planes, in other words it is a hyperbolic space. Its dimension is always 
even: 

THEOREM 3.8. Let V be non-singular and U any subspace of V. 
Write U = rad U _L W and let N, , N 2 , ,N r bea basis of rad U. 
Then we can find, in V, vectors MI , M 2 , , M r such that each 
Ni , Mi is a hyperbolic pair and such that the hyperbolic planes 
P { = (N % , M % ) are mutually orthogonal and also orthogonal to W. 
V will therefore contain the non-singular space 

U = P! J_ P 2 J. - - J. P r W 

which, in turn, contains U. Let finally v be any isometry of U into 
some non-singular space V f . Then we can extend a to an isometry a 
of U into V. 
Proof: 

1) For r = nothing is to be proved. We may, therefore, use induc- 
tion on r. The subspace 

U = <tf t , N 2 , - , AT,.,) _L W 
is orthogonal to N r but does not contain N r . We have 

rad t/* = rad U Q = (N, , N 2 , - - , N r ^). 

This implies N r c U* but N r 4 rad U * . The space U% must, therefore, 
contain a vector A such that N r A ^ 0. The plane (N r , A) is non- 
singular and contained in U* . It is spanned by a hyperbolic pair 
N r , M r . Set P r = (N r , Af r ). Since P r C U * , our space (/ is ortho- 
gonal to P r which shows that C7 is contained in the non-singular 
space P* . The radical of U Q has only dimension r 1. By induction 
hypothesis (used on P* as the whole space) we can find hyperbolic 
pairs N % , M, in P* (for i < r 1) such that the P % are mutually 
orthogonal and also orthogonal to W. Since they are orthogonal 
to P r and since P r is also orthogonal to W we have finished the 
construction of U. 

2) Let a be the isometry of U into V, N'< = <rN t and W = <rW. 
Then <rU = (Nl , Ni , , N' r ) JL W. We can, therefore, find in V 
vectors M i such that ATJ , M( is a hyperbolic pair and the planes 
p r . = (N( , M() are mutually orthogonal and orthogonal to W. The 
map (T of U into V is now extended to a map 6- of U into V by pre- 
scribing irMi = Mi ; & is clearly an isometry. 
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THEOREM 3.9. (Witt's theorem). Let V and V be non-singular 
spaces which are isometric under some isometry p. Let <r be an isometry 
of a subspace U of V into V. Then a can be extended to an isometry 
of V onto V. 

Proof :_ 

1) If U is the non-singular subspace of V which was constructed 
in Theorem 3.8, then a can be extended to an isometry of U into V. 
This reduces the problem to the case where U is non-singular, an 
assumption which we shall make from now on. 

2) For a symplectic space V let U' be the image of U under <r 
and write V = U JL U* and V = U' _L U'*. All we have really 
to do is to show that U* and U'* are isometric. But this is clear. 
V and V have the same dimension since the isometry p exists. 
7* and U'* have, therefore, also the same dimension and they are 
non-singular; by Theorem 3.7 they are hyperbolic (we are in the 
symplectic case) and, hence, clearly isometric. We may assume from 
now on that our geometry is orthogonal. 

3) Should our theorem be true for non-singular subspaces of 
lower dimension, then we can proceed as follows: 

Assume that U = t/i J_ U 2 where U\ and U 2 are proper subspaces 
of U. We shall denote by U{ the image of Ui under <r and by r 
the map U\ > U{ which <r induces by restriction. The map of C/i 
into V which o- induces can be extended to all of V by induction 
hypothesis; the space U* will, thereby, be mapped onto the space 
J7i*. This means that U* and U{* are isometric. Let us return to our 
<r. It will map the space U 2 (which is orthogonal to Ui) into the 
space C/i*; this mapping can (by induction hypothesis) be extended 
to an isometry X of U* onto /{* arid now we are finished since 
r J_ X is the desired extension. 

4) We have still to show that the theorem is true if U is non- 
singular and irreducible. U = (A) is then a line and A 2 5* 0. For the 
image C = <rA we have, therefore, C 2 = A 2 . We must now remember 
that an isometry p of V onto V was also given so that this vector 
C will be the image p(B) of a vector B of V. We will know A 2 = B 2 . 
If we can find an isometry r of V onto V which carries A into B, 
then pr will be an isometry of V onto V which maps A onto C. To 
construct this T we proceed as follows. 

Since (A + B)(A - B) = the vectors A + B and A - B are 
orthogonal. They can not both be isotropic since 2A = (A + B) + 
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(A E) is not isotropic. Denote by D = A + eB (e = 1) a non- 
isotropic one of the vectors A B. By what we have seen A eB 
belongs to the hyperplane H = (A + cB)* = {/})* Let n be the 
isometry which is identy on // and sends D into D (/* is an ortho- 
gonal sum of isometries on (D) and (Z)}*). Then 

e5) = -A - eB, 
#) = A - eB. 

Adding the two equations we get juA = eB (k is not of charac- 
teristic 2 in the orthogonal case). 

If e = 1 we are done. If e = +1, then pA = B. Let v be the 
map which sends every vector X of V onto -3T. It is obviously an 
isometry of V and v^A B. This finishes the proof of our theorem. 

Theorem 3.9 has many important consequences. An isotropic 
subspace U is naturally called maximal isotropic if U is not a proper 
subspace of some isotropic subspace of V. 

THEOREM 3.10. All maximal isotropic subspaces of a non-singular 
space V have the same dimension r. This invariant r of V is called the 
index of V. 

Proof: Let Ui and U 2 be maximal isotropic and suppose dim Ui < 
dim C7 2 . We can find a linear map of f/i into U 2 which is an isomor- 
phism into. Since both f/ t and U 2 are isotropic such a map is trivially 
an isometry. By Theorem 3.9 it can be extended to an isometry a 
of V onto V. Thus vU l C U 2 and consequently Ui C <r~ l U 2 . The 
space <r" l f/ 2 is again isotropic. Since U l is maximal isotropic we 
must have U l = v~ l U 2 which shows dim C/i = dim U 2 . 

THEOREM 3.11. The dimension of a maximal hyperbolic subspace 
is 2r where r is the index of V. Therefore 2r < n = dim V and the 
maximal value %n for the index r is reached if and only if V itself is 
hyperbolic (for instance in the symplectic case). A hyperbolic subspace 
7/2* is maximal hyperbolic if and only if the space II*, does not contain 
any non-zero isotropic vectors. One achieves, therefore, a splitting 
V = H 2r JL W where W may be called anisotropic. The geometry of 
V determines uniquely the geometry of W (indepently of the choice 
ofH 2r ). 

Proof: 

1) If U is a maximal isotropic subspace of V and r = dim [/, then 
Theorem 3.8 shows the existence of a hyperbolic space H 2r of dimen- 
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sion 2r. Let W = 7/ 2 * so that we have V = # 2r 1 TF. Should TF 
contain an isotropic vector AT 5^ 0, then N would be orthogonal 
to 7/ 2r , hence to the subspace U 7 and not be contained in U. The 
space U JL (N) would be isotropic, contradicting the fact that U 
was maximal isotropic. 

2) If H 2t is any hyperbolic subspace of F, then H 2 . has the form 
(Nl , M() JL(Ni,MQ ... (N: , Mi) and (Ni , tf J , , tf 
is an isotropic subspace of dimension s. This shows s < r. We can 
write H 2r = H 2a JL // 2 <r-o where H 2a and H 2(r _ a} are hyperbolic of 
dimensions 2s respectively 2(r s). There exists an isometry a of 
F onto V which carries H 2a onto H 2a . Let H 2(r - a) and W be the 
images of // 2 (r-.> respectively TF under <r. Then 



and we see that H 2 ( r -.) J_ TF ; is the space orthogonal to H 2t . If 
s < r, then this space will contain isotropic vectors; if s = r, then 
it reduces to W which is isometric to W. 

THEOREM 3.12. Let V be non-singular. If Ui and U 2 are isometric 
svbspaces of F, then U* and U* are isometric. 

Proof: There is an isometry of F onto F which carries Ui onto U 2 . 
It carries, therefore, U* onto U* 2 . 

For several later applications we shall need a lemma about isome- 
trics of hyperbolic spaces. 

THEOREM 3.13. Let V = (N, , M,) JL <# 2 , M 2 ) J_ - JL (N r , M r ) 
be a hyperbolic space and a an isometry of V onto V which keeps each 
vector N, fixed. Then a is a rotation. As a matter of fact <r is of the 
following form: 



aAT t = N. , aM, = E N v a,, + M, 

-i 

where the matrix (a,/) is symmetric if the geometry is symplectic and 
skew symmetric if it is orthogonal. 

Proof: We have of course <rN { = JV, . Let 



We find 

= b tl 
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and, since we must have aN^aMj = N t M,- if a- is an isometry, we 
get 6',,- = 1 and ft,, = for i 7* j. Thus 

*M, = Z AU,, + Mi . 

v-\ 

We mast still check that rAf.-^lf, = .Tf./lf,. = 0. This leads to 
iXa,, + M.YE tf A/ + M,) = o,, + (M.ATX, = 0. 

,.l /\M-1 ' 

If F is symplectic, then M % N t = 1 and we get a,- t = a t ,- . If F 
is orthogonal, then a,, = a,,- and since we assume in this case 
that the characteristic of k is ^ 2 we see that (a,-,) is skew symmetric. 

Obviously, in both cases, det a = +1. 

We shall now turn to isometrics of general non-singular spaces. 
The identity shall be denoted by 1 or by l v if a reference to the 
space is needed. The map a which sends each vector X onto X 
satisfies (I + o-)X = X X = 0, 1 + <r = and shall, therefore, 
be denoted by 1 respectively 1 F if a reference to the space V 
is needed. 

Let V = U JL W. Then we can form the isometry o- = lu JL l w 
It is obviously of interest only if the characteristic of k is 5^ 2 since 
otherwise it would be merely l v . If X = A + B with A t U and 
B e W", then aX = - A + B. We have <rX = .Y if and only if A =0, 
or X c T7, and o-X = X if and only if B = or X t U. This means 
that U and W are characterised by or. Such a map o- satisfies obviously 
a 2 1. We shall now prove 

THEOREM 3.14. An isometry of a non-singular space V is called 
an involution if a 2 = I. If the characteristic of k is ^ 2 then every 
involution is of the form \u JL lw (resulting from a splitting V = 
U _L W}. If the characteristic of k is 2, each involution of V will keep 
every vector of a certain maximal isotropic subspace of V fixed and 
be of the type of isometries discussed in Theorem 3.13. 

Proof: 

1) Suppose o- 2 = 1. Then XY = aX-vY and vX-Y = SX-aY = 
X-vY. Consequently 

(*X - X)(<rY + 7) = trX-o-F + <rX- F - X-o-F - XY = 0. 

The two subspaces U = (a- 1)F and TF = (o- + 1)F are, therefore, 
orthogonal. 
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2) Let the characteristic of k be 9* 2. A vector <rX X of U is 
reversed by o-, a vector <rX + X of W is preserved by cr. Consequently 
U C\ W = 0. Since any vector X can be written as %((?X X) + 
%(vX + X) we see that V = C7 TT and o- = -! J. 1* . 

3) If the characteristic of k is 2 we are in the symplectic case. 
The two maps a 1 and <r + 1 are the same and 1) means now that 
(<r + 1) V is isotropic. Let K be the kernel of the map o- 1 = o- + 1, 
hence the subspace of all vectors of V which are left fixed by or. 
Then (a + 1) V C K. Let us write 

K = dK K Q . 

Since or/C = K , we have <rJ* = X? and thus (<r 1) K% C ^*- 
The only vectors of K which are contained in K* Q are those of rad K 
which means that rad K is the kernel of cr 1 if applied to K% . 
This shows 

dim Kl = dim (a- - 1)X? + dim rad K. 

Both (0- !)/* and rad K are isotropic subspaces of the non- 
singular space K* Q and have, therefore, dimensions at most J dim K% . 
We see that this value is reached, i.e., that rad K must be a maximal 
isotropic subspace of K* Q . If K l denotes a maximal isotropic subspace 
of K , then K 2 = rad K _L K l is a maximal isotropic subspace of V. 
But K 2 C K shows that every vector of K 2 is left fixed by cr. 

DEFINITION 3.10. Suppose the characteristic of k is j& 2. If 
er= luJLlw and p = dim U, then we call p the type of the 
involution a. We have obviously det cr = ( l) p . Since C7 has to be 
non-singular the type p must be an even number if V is symplectic; 
if V has orthogonal geometry, then p might be any number < n = 
dim V. An involution of type p = 1 shall be called a symmetry with 
respect to the hypcrplane TV. An involution of type 2 shall be called 
a 180 rotation. 

The reason for the word symmetry should be clear: U = (A) 
is a non-singular line, 17* = W a non-singular hyperplane and the 
image of a vector xA + B (with B z W) is xA + B. 

The following theorem characterizes the isometrics 1? 

THEOREM 3.15. Let V be non-singular and a an isometry of V 
which keeps all lines of V fixed. Then a = lr . 
Proof: If o- keeps the line (X) fixed, then aX = Xa and for any 
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Y t (X) we have <rY = <r(Xb) = <r(X) -b = Xab = Fa. This a may 
still depend on the line (X) if o- keeps every line of V fixed. If (X) 
and (F) are different lines, then X and Y are independent vectors. 
We have on one hand a(X + Y) = (X + Y)>c and on the other 
a(X + F) = cr(X) + o-(F) = Xa + Yb. A comparison shows 
a = c = b and we know now that we have <rX = Xa with the same 
a for all X. Let X and F be vectors such that XY 7* 0. Then JfF = 
(rZ-crF = Xa-Ya = (XF)a 2 . We see a 2 = 1, a = 1. 

4. Special features of orthogonal geometry 

In this section V will stand for a non-singular space of dimension n 
with an orthogonal geometry. 

DEFINITION 3.11. The group of all isometrics of F into V is 
called the orthogonal group of V and is denoted by 0. The subgroup 
of all rotations is denoted by O + and the commutator subgroup of 
by 0. If it is necessary to be more specific in these symbols, then we 
shall put n as a subscript and inclose in a parenthesis any further 
information. For instance O n (/c, /) denotes the orthogonal group 
of a space over k whose dimension is n and whose geometry is based 
on the quadratic form /. + is the kernel of the determinant map; 
the image is commutative, hence 12 C O + . The index of O h in is 2 
since reflexions (for instance symmetries) are present if n > 1. 

We state a slight improvement of Witt's theorem. 

THEOREM 3.16. Let <r be an isometry of a subspace U of V into 
V. It is possible to prescribe the value 1 for the determinant of an 
extension r of a to all of V if and only if dim U + dim rad U < n. 

Proof: 

1) If T! and T 2 are two extensions of a- to V whose determinants 
have opposite sign, then p = rrV 2 will be a reflexion of V which 
keeps every vector of U fixed. Conversely, if p is a reflexion which 
keeps every vector of U fixed and TI one extension of a to F, then 
r 2 = TIP is another extension and det r 2 = det r t . All we have to 
do is to see whether such a reflexion exists. 

2) Suppose dim U + dim rad U < n. Tho space U of Theorem 
3.8 is of dimension dim U + dim rad f/. We can write V = _L 0* 
where U* ^ 0. Let X be a reflexion of U* and put p = !# _L X. 

3) Suppose dim U + dim rad U = n. Then F = = H 2r _L W. 
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We are looking for a p which is identity on U and, therefore, on W. 
It must map W* = fI 2r onto itself and be a reflexion on H 2r . But 
it should keep every vector of rad (/ (which is a maximal isotropic 
subspace of H 2r ) fixed. Theorem 3.13 shows that such a p does not 
exist. 

THEOREM 3.17. Let <r be an isometry of V which leaves every 
vector of a hyperplane H fixed. If H is singular, then a = 1. // H is 
non-singular, then a is either identity or the symmetry with respect 
to II. The effect of an isometry on a hyperplane H determines the 
isometry completely if II is singular, and up to a symmetry with respect 
to II if II is non-singular. 

Proof: 

1) Let H be singular. The line L = //* = (N) is also singular, 
N is isotropic. We have rad H = rad H* = (N) and can write 

// = (N) J_ W 

where W is non-singular and of dimension n 2. The plane W* is 
non-singular and contains N. There exists, therefore, a unique 
hyperbolic pair N, M in W*. If a leaves // element- wise fixed then 
<rW* = W*. The pair <rN, aM is again hyperbolic and <rN = N; 
the uniqueness gives now <rM M which shows a to be identity on 
IF* as well as on W. 

2) Let // be non-singular. The line L = //* = (A) will be non- 
singular and V = L J_ H. If a is identity on H then cr = A J. 1 H 
and X, as isometry on L, must be zfcl/, . 

3) If <r and T have the same effect on H, then <r"V leaves every 
vector of // fixed. If H is singular, then r~ l r = 1, <r = r. If H is 
non-singular, <r~ l r may be the symmetry with respect to //. 

We can improve on Theorem 3.15 in two ways. 

THEOREM 3.18. Suppose that n > 3 and that V contains isotropic 
lines. If or keeps every isotropic line fixed, then a = 1 v 

Proof: V contains a hyperbolic pair N, M. The space (N, M)* 
is non-singular and ^ 0. Let B e (AT, M)*. If B 2 = then *B = Be 
by assumption. Assume now B 2 ^ 0. Then (AT, M) (B) = (N, M, B) 
is non-singular and (B) the only line in (N, M, B) which is orthogonal 
to (N, M). If we can prove that a- maps (N, M, B) onto itself, then 
we obtain again crB = Be since, by assumption, <rN = Na and 
<?M = Mb. 
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Regardless of whether B 2 = or not, the vector N JB 2 M + B 
is easjly seen to be isotropic. By assumption we have, therefore, on 
one hand 



= dN - B 2 -M + dB 

2i 

and on the other hand 

<T(N - I B*-M + BJ = aN - |# 2 -M + a(B). 

A comparison shows first that a(B) t (N, M, B) which proves 
a(B) = Be also if B 2 7* 0, and secondly that a = d = c. If B is selected 
so that B 2 j 0, then one sees in addition that a = d = b. This 
proves now that every vector of (N, M) and every vector of (AT, M}* 
is multiplied by a. Therefore <rX = Xa for all X e V and Theorem 
3.15 shows ff = l v . 

THEOREM 3.19. Suppose that & leaves every non-isotropic line 
of V fixed but omit the single case where V is a hyperbolic plane over 
afield with 3 elements. Then a = dbl F . 

Proof: Because of Theorem 3.15 we may assume that V contains 
a hyperbolic pair N, M. 

1) dim V > 3; then dim(AT, M)* > 1. Let B be a non-isotropic 
vector of (N, M)*. Both B and N + B are non-isotropic, hence, by 
assumption, aB = Bb and <r(N + B) = (N + B)c = <r(N) + Bb] 
therefore a (AT) = Nc + B(c b). But <r(N) must be isotropic; we 
deduce c = 6, 0(N) = Nc and o- keeps also the isotropic lines fixed. 
We use now Theorem 3.15. 

2) V = (AT, M). If the lines (AT), (M) are not loft fixed, then a 
must interchange them since they are the only isotropic lines of F. 
Hence <rN = aM, <*M = WV. From ATM = crAT-crM = aM-bN = 1 
we deduce b = I/a . Let c be distinct from 0, d=a. The vector AT + cM 
is not isotropic, hence, by assumption, 

a(N + cM) = d(N + cM) = dN + dcM 
on one hand and 

a(N + cM) = aM + -N 
a 



CHAPTER III 129 

on the other. A comparison yields a = dc , c/a = d, hence c 3 = a 2 , 
a contradiction. 

In our next theorem, due to E. Cartari and J. Dieudonne*, we 
try to express each isometry of V by symmetries with respect to 
hyperplanes. 

THEOREM 3.20. Let V be non-singular and dim V = n. Every 
isometry of V onto V is a product of at most n symmetries with respect 
to non-singular hyperplanes. 

Proof: The theorem is trivial for (7=1 and also for n = 1. We 
use induction on n and have to distinguish 4 cases. 

Case 1). There exists a non-isotropic vector A, left fixed by <7. 
Let H = (A)*] then <?H = /?. Let X be the restriction of a to H and 
write X = r v r 2 - r r with r < n 1 (by induction hypothesis) 
where r t is a symmetry of the space H with respect to a hyperplane 
Hi of //. Put Ti ; = IL JL Ti where L = (A). Each f t leaves the hyper- 
plane L JL H, of V fixed and is, therefore (as reflexion), a symmetry 
of F. The product f^ f r is \ L _L X = o- and we know in this 
case that <r can be expressed by at most n 1 symmetries. 

Case 2). There exists a non-isotropic vector A such that o-A A 
is no/ isotropic. Let H = (0-A A)* and let r be the symmetry 
with respect to H. Since (<rA + A)(erA - A) = (<rA) 2 - A 2 = 
(o- is an isometry) we have crA + A e //. Therefore 

T(O-A + A) = o-A + A, r(o-A - A) = A - o-A. 

Adding we have: rcr(2A) = 2A which shows that TO- leaves A fixed. 
By case 1) TO- = r^a r r where r < n 1. Since r 2 = 1, we 
obtain, after multiplication by r from the left, o- = rr 1 T 2 r r , a 
product of at most n symmetries. 

Case 3). n = 2. Cases 1) and 2) allow us to assume that V 
contains non-zero isotropic vectors: V = (AT, M) where N, M is 
a hyperbolic pair. 

a) <?N = aM, vM = a'W. Then cr(AT + aAf) = aM + N is a 
fixed non-isotropic vector. We are in Case 1). 

b) crN = aN, aM = a~ l M. We may assume a j 1 since o = 1 
means a- = 1. A = AT + M and <rA - A = (a - l)N + (a' 1 - 1)M 
are not isotropic which brings us back to Case 2). 

Case 4). We can now assume that n > 3, that no non-isotropic 
vector is left fixed, and, finally, that o-A A is isotropic whenever 
A is not isotropic. 
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Let N be an isotropic vector. The space (AT)* has at least dimension 
2 and its radical is the same as that of (N) (i.e., it is (N)). (N)* 
contains a non-isotropic vector A. 

We have A 2 ^ and (A + eAT) 2 = A 2 7* 0. We conclude by our 
assumption that a A. A as well as the vectors 

a(A + *N) - (A + cN) = (<rA - A) + *(<rN - N) 
are isotropic. The square is, therefore, 

2e(trA - A)(<rN - N) + 2 ((rAT - N) 2 = 0. 

This last equation is written down for e = 1 and e = 1 and added. 
We get 2(crAT - AT) 2 = or (<rN - AT) 2 = 0. 

We know, therefore, that crX X will be isotropic whether X is 
isotropic or not. The set W of all these vectors aX X is the image 
of V under the map a- I. It contains only isotropic vectors and is, 
consequently an isotropic subspace of V. A product of any two of its 
vectors will be zero. 

Let now X t V and Y e W*. Consider 

(<rX - X)(*Y - F) = = <rX-<rY - X-<rY - (<rX - X) - Y. 

But <rX X e W, Y t TF* so that the last term is 0. Furthermore, 
aX-vY = XY since <r is an isornetry. Hence 

X(Y - (r7) = 

which is true for all X t V. This means Y 0Y t rad V = or 
F = <?Y. 

We see that every vector of W* is left fixed. We had assumed 
that a non-isotropic vector is not left fixed. This shows that T7* is 
an isotropic subspace. We know dim W < \n and dim W* < \n 
since these spaces are isotropic. From dim W + dim W* = n we 
conclude that the equality sign holds. The space V is, therefore, 
a hyperbolic spare H 2r , n = 2r and IV* a maximal isotropic subspace 
of H 2r . The isometry a leaves every vector of W* fixed , arid Theo- 
rem 3.13 shows o- to be a rotation. 

We can conclude, for instance, that for the space V = II 2r our 
theorem holds at least for reflexions. Let now T be any symmetry 
of V = H 2r . Then ra is a reflexion of H 2r , hence TO- = T^Z r. 
with s < n = 2r, but since TO- is a reflexion s must be odd, hence 
s < 2r 1. We get <r = rr^ 2 r. , a product of s + 1 < 2r = n 
symmetries. 



CHAPTER III 131 

REMARK 1. Let cr = r,r 2 r, where r< is the symmetry with 
respect to the hyperplane 77 t . Let U t = H l n 77 2 C\ C\ II 4 ; . Then 
codim /, + codim 77, M = codim ?/,-+, + codim(/ t + 7/ t4l ). 
Hence 

codim [/"+!< codim f7, + 1. 

This shows codim f/ t < i, especially codim (H L C\ 77 2 H C\ 77.) < s 
or 

dim(77 , n 77 2 n n 77.) > n - s. 

Every vector of H { (^ II 2 C\ O 77, is left fixed by every r, and, 
therefore, by a. This shows that every vector of a subspace of a 
dimension at least n s is left fixed by o-. Should s < n, then a- 
leaves some non-zero vector fixed. We conclude that an isometry 
without a fixed non-zero vector (as for instance 1 F ) can not be 
written as product of less than n symmetries. 

If o- is a product of n symmetries, then det <7 = ( l) n . Thus we 
state: 

If n is even, then every reflexion keeps some non-zero vector 
fixed, and if n is odd, the same holds for a rotation. 

REMARK 2. Suppose that a can not be written by less than n 
symmetries, and let T be an arbitrary symmetry. Then det(ro-) = 
det o-, which means TO- = TiT 2 - T, with s < n and, hence, 
a = TT I T. (s + 1 < n). We see that the first factor in the product 
for (7 can be an arbitrarily chosen symmetry. 

Examples 

I. dim 7 = 2. 

Every reflexion is a symmetry and every rotation is a product 
of two symmetries in which the first factor can be chosen in advance 
(also for 1 = TT). Theorem 3.17 shows that a rotation o- 7* 1 can 
leave only the zero vector fixed and that any rotation is uniquely 
determined by the image of one non-zero vector. 

Let r be a symmetry, a = TTI a rotation. Then rar" 1 = T Z T I T = 
TIT = (rrj)' 1 = <r~ l . We have, therefore 

(3.36) T(TT~ I = (T 1 . 

If <7j = rr 2 is another rotation, we have 

= T(r 2 c7T2 )T~ I = T<r~ r~ = (7. 
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Thus the group 02 is commutative and the interaction with the coset 
of symmetries is given by (3.36). 

Theorem 3.14 tells us that dblr are the only involutions among 
the rotations, if n = 2. 

This also allows us to answer the question when the group 2 
itself is abelian. Should this be the case, then (3.36) shows o- = o-" 1 
or o- 2 = 1, i.e., that there are only two rotations. 0\ has index 2 in 2 , 
rOa is the other coset (r a symmetry) so that there are only two 
symmetries in such a geometry. This means that V contains only 
two non-isotropic lines. If A, B is an orthogonal basis, then (A) 
and (B) are the only non-isotropic lines. Hence the two lines (A + B) 
and (A B) are isotropic and, since V can only contain two isotropic 
lines, we have enumerated all lines of F. The field k has only three 
elements. Conversely, if F is a hyperbolic plane over a field with 
three elements, then V contains four lines, two isotropic and two non- 
isotropic ones. There are two symmetries and thus two rotations. 
These rotations can only be dbly and we have v = a' 1 for every 
rotation; 2 is commutative. To summarize: 

If dim V = 2, but if V is not a hyperbolic plane over a field with 
three elements, then 2 is not commutative and 2 contains rotations 
<r for which a 2 7* 1. 
II. dim 7 = 3. 

Theorem 3.17 shows that a rotation 5* 1 can not leave two inde- 
pendent vectors fixed. However it is a product of two symmetries 
and leaves, therefore, all vectors of some line (A) fixed. We call 
(A) the axis of the rotation. 

Let ff be an isometry which keeps one non-zero vector A fixed 
and reverses some other non-zero vector B. The vectors A, B are 
independent, o- 2 will be a rotation which keeps every vector of 
(A, B) fixed. By Theorem 3.17 we have o- 2 = 1 (but a ^ dbl). It 
follows that cr is an involution 7* 1, hence a symmetry, if it is a 
reflexion, and a 180 rotation, if it is a rotation. 

If a rotation o- is written as product Tir 2 of symmetries, we can 
also write a- = ( r 1 )( r 2 ) as product of two 180 rotations. 

Let us look finally at all rotations with given axis (A) (include 
also <r = 1). They form obviously a subgroup of 0\ . We have to 
distinguish two cases. 

1) (A) non-isotropic. We can write V = (A) (A)* and our 
group is obviously isomorphic to the group 2 of (A)*. 
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2) A = AT, N 2 = 0. We have rad (N)* = rad (N} = (N). Hence 
(N)* = (AT, J5) with NB = 0, 2 5* 0. We shall determine all isom- 
etries a of F for which 0N = 2V. Then <r(N)* = (2V)* and <r is de- 
scribed completely by the effect it has on the singular plane (N)* 
(Theorem 3.17). We set <rB = xN + yB and from B* = (er B) 2 = 
ifB 2 get y = 1. 

Let r x be the symmetry with respect to the plane perpendicular 
to %xN - B. Since N-(%xN - B) = we have r x AT = AT; r x B = 
rXIzAT - (%xN - J5)) = %xN + %xN - JB = xN - J5. For x = 
we have r 2V = TV, T O # = B. If we denote r r z by cr,. , then or x is a 
rotation and 



The symmetries T X and the rotations cr z are now all isometrics 
which keep N fixed. Obviously 

<r x cr v = <r x + v 

which shows that the rotations with axis (N) form a group isomorphic 
to the additive group of 7c. 

We remark that ar x = (o- x/2 ) 2 which shows that every <r x is the 
square of a rotation. 

Let us look also at reflexions. If a is a reflexion, then o- is a 
rotation. Should v 7* 1, then cr will have an axis (A). This 
means that a reverses the vectors of precisely one line (A). Should 
cr keep some non-zero vector fixed, then o- is a symmetry, as we 
have seen earlier. The reflexions without fixed vectors are exactly 
those that can not be written as product of less than three symmetries. 

Some of these results are useful if n > 3 . 

THEOREM 3.21. Let dim V > 3. The centralizer of the set of 
squares of rotations consists of 1^ . 

Proof: 

1) Suppose V contains an isotropic line (N). Then rad (N)* = 
rad (N) = (N) and we can write 

W 



where W is non-singular and of dimension n 2 > 1. Let U be 
a non-singular subspace of W of dimension n 3 (possibly U = 0) 
and write V = U* _L U. This space U* is non-singular, of dimension 
3, and contains (N). Let cr x be a rotation of U* with axis (N); <r m is 
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the square of the rotation <r x/2 . If we put a = (r x/2 J. l v , then 
(r 2 = IT, JL It, . Which vectors of V are left fixed by o- 2 ? Let A t [/*, 
B t U and suppose A + B is left fixed by <r x J_ lu Obviously A must 
be left fixed by v s and this means A t (N). The fixed vectors are, 
therefore, those of (N) _L U = W Q , a subspace whose radical is (N). 
Let now r be commutative with all squares of rotations. Then 
ro-V" 1 = <r 2 ; rcrV" 1 leaves every vector of rW fixed so that rW Q = W . 
This means that the radical r(N} of rW is (AT). Our r leaves every 
isotropic line (N) fixed and Theorem 3.19 shows r = lr . 

2) Suppose V does not contain isotropic vectors. Let P be any 
plane of V. We know that there exists a rotation X of P such 
that X 2 j* 1. Write 7 = P P* and let a = X 1 P . . Then a 2 
leaves only the vectors of P* fixed. If T<r 2 r~ l = o- 2 , then we see as 
before that rP* = P* and consequently rP = P. The element T 
leaves every plane fixed. A given line is an intersection of two planes 
(since n > 3) and we see that r leaves every line of V fixed. From 
Theorem 3.15 we see now r = 1^ . 

The meaning of Theorem 3.21 will become clear a little later. 

Let now TI and r 2 be symmetries with respect to the non-singular 
hyperplanes H v and H 2 . Assume again n > 3. Let (A) = H\ and 
(B) = H\ . (A) and (B) are non-singular, the space P = (A, B) 
(of dimension < 2) not isotropic, its radical of dimension < 1. 
We have again rad P* = rad P, P* has dimension at least n 2 and 
contains a non-singular subspace U of dimension n 3. The space 
U C H i P\ 7/2 , since U is orthogonal to both A and B. TI and r 2 
are identity on U and symmetries f l , f 2 on the three dimensional 
non-singular space C7*. We can write 

r l r 2 = ( ^ JL Itf)(r 2 J_ 1^) 

and have replaced r v r 2 by a product of two 180 rotations. 

Let now a- = r,r 2 r. be any rotation of F, expressed as product 
of s < n symmetries. Since s must be even, we can group the product 
in pairs and replace them by 180 rotations. Thus we have 

THEOREM 3.22. // n > 3, then every rotation is a product of an 
even number < n of 180 rotations. 
For our next aim we need a group theoretical lemma. 

LEMMA. Let G be a group and S a subset of elements with the 
following properties: 
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1) G is generated by 8. 

2) For x e G and s e 8 we have xsx~~ l e S. 

3) s* = 1 f r all s e S. 

Then the commutator subgroup G' of G is generated by the commutators 
818981* 83* = (siS 2 ) 2 , Si e *S, 5 2 $ G' contains the squares of all elements 
ofG. 

Proof: Let ff be the set of all products of elements of the form 
SiS^s^s^ 1 with s lt s 2 e S. H is obviously a group and property 2) 
shows that it is invariant in G. 

Let / be the canonical map G > G/// with kernel ff. Since 
s^srW 1 e H wehave/^Sz^r 1 ^ 1 ) = 1, hence /(si)/( 2 ) = /(^/(siX 
The set /S generates G so that the image /(G) = G/# is generated 
by f(S). We see that /(G) is a commutative group which shows 
that G 1 is contained in the kernel H. Trivially, since H is generated 
by commutators, H C G'. Every generator of /(G) has at most 
order 2 so that every element in /(G) has at most order 2. This 
proves f(x 2 ) = 1, x 2 c H. 

If o- is a symmetry (180 rotation) with respect to the hyperplane 
H (with the plane P) then rcrr" 1 is again a symmetry (180 rotation) 
with hyperplane rH (with plane rP) and o- 2 = 1. The set of all 
symmetries (180 rotations) generates the group (the group 0*) 
and satisfies the conditions of the lemma. 

For n > 3 the commutator group of 0^ is generated by all (o^oa) 1 , 
where <r 1 and o- 2 are 180 rotations, and it contains all squares of 
rotations. The commutator group ft n of O n is generated by all (r^a) 2 
where TI and r 2 are symmetries; since r^ is a rotation, (r^a) 2 is a 
square of a rotation. This shows that 12 n is not larger than the com- 
mutator group of On . The meaning of Theorem 3.21 becomes clear. 
We have determined the centralizer of the group Q n , therefore, also 
the centralizer of + n and O n (since l r obviously commute with 
every element of O n ). Thus we have 

THEOREM 3.23. Let n > 3. The groups O n and Ot have the same 
commutator group 12 n which is generated by all (r^) 2 where TI and r a 
are symmetries and contains all squares of elements of O n . The cen- 
tralizer of O n , Ot or O n is dblp . The center of O n is l r .Ifn is odd, 
then l r is the centet of 0* n and ofto n ( l v is a reflexion). If n is even t 
then ifcl F is the center of On ; the center of ti n is l v or l v depending 
on whether l v e Q n or not. In the factor groups O n /8n and 0* n /Q H all 
elements have at most order 2. 
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Another important property of the group 12 n is expressed in 

THEOREM 3.24. // n > 3, then 12 n is irreducible. This means 
that no proper subspace U 7* is mapped into itself by all elements 



Proof: Suppose that there is such a proper subspace U ^ 0. 
We shall derive a contradiction and distinguish two cases. 

1) U is non-singular. Write V = U JL W\ if o- e Q n , then, by 
assumption, crU = U and hence <r = T _L p with r e 0(C7), p c 0(W). 
The involution X = lu JL lw would satisfy o-X = Xer and this 
implies that X is in the centralizer of 12 n , which is a contradiction to 
Theorem 3.23. 

2) U is singular. Then rad U 7* is also mapped into itself by & n 
and, replacing U by rad U, we may assume that U is isotropic. 
Let N j be a vector of U and AT, M a hyperbolic pair. Since 
n > 3 we can find a three-dimensional non-singular subspace W = 
(N, M, A) where A is a non-isotropic vector of (AT, M}*. Isotropic 
subspaces of W have at most dimension 1 so that U P\ W = (N) 
(remember that U is isotropic). Let a x ^ 1 be a rotation of W with 
axis (M). If <r t would keep the line (N) fixed, then it would induce 
a rotation of the plane (N, M); but <r x M = M would then imply 
<r x N = N and consequently <r x = l w (two vectors are kept fixed). 
The line (N) is, therefore, moved into another line of W t hence 
into a line not contained in U. Setting <r = <r x _L 1 w we have o- e 12 n 
since o-, = (o- x/2 ) 2 , and 0-./V ^ U. 

EXERCISES. 

1) Investigate the irreducibility of the groups O n , + n , 12 n if n = 2. 

2) Let V be the corresponding project! ve space. Consider the 
hypersurface of V defined by X 2 = 0. What is the geometric meaning 
of the maximal isotropic subspaces of Vt 

5. Special features of symplectic geometry 

We have seen in Theorem 3.7 that every non-singular symplectic 
space is hyperbolic: 

V = H 2r = P l JL P 2 JL - - JL P r 
(3.37) 

= (N, , M, , N 2 , M 2 , - , N r , M r ) 
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where each P, = (N< , M t -) is a hyperbolic plane. The N t , M , satisfy 
(3.38) #,#, = M,M, = 0, NMi = 1, #^, = for i/ 5^ /x. 

DEFINITION 3.12. We call a basis of 7 of the type (3.37), (3.38) 
and in the arrangement (3.37) a symplectic basis of V. The dis- 
criminant G of a symplectic basis is 1 as one easily computes. 

DEFINITION 3.13. The group of isometrics of a non-singular 
symplectic space V = H 2r of dimension n = 2r is called the symplectic 
group and denoted by Sp n (k). 

We are going to show that Sp n (k) contains only rotations and shall 
give two proofs for this fact. 

Let A be a given non-zero vector of V. We ask whether Sp n (k) 
contains an element 0- which moves each vector X e V by some vector 
of the line (A): 



= X + <p(X)-A, <p(X) e k. 
We should have cr(X) -<r(Y) = XY. This leads to 
(3.39) 



Select Fo so that (AY ) ^ 0, substitute Y = Y Q in (3.39) and com- 
pute <p(X). We obtain 

(3.40) <p(X) = c-(AX) 

where c e k is a constant; (3.40) certainly satisfies (3.39) and con- 
sequently 

(3.41) a(X) = X + c-(AX)-A. 

We verify immediately that any map of the form (3.41) is a homo- 
morphism of V into V. Since V is non-singular, a is indeed an iso- 
metry. If c = 0, then o- = l v . If c j 0, then a leaves X fixed if 
and only if AX = 0, which means that X must belong to the hyper- 
plane // = (A)*. We can characterize these special isometrics in 
another way. Let H be a given hyperplane. When does a leave every 
vector of H fixed? If X t V and Y e H, then 



= <rX-0 = 
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since we suppose <rY = Y. This means crX X e H*. But //* is a line 
and we 'are, therefore, back at our original question. 

DEFINITION 3.14. An isomctry of the type (3.41) shall be called 
a symplectic transvection in the direction A. 

For a fixed A our a still depends on c. If we denote it by o- c we 
find <r e -cr d = <r e+d and a c = l v if and only if c = 0. This means 
that the symplectic transvections in the direction A form a com- 
mutative group, isomorphic to the additive group of k. 

Let A and B be two non-zero vectors of V. When can we find a 
symplectic transvection which moves A into J5? The direction 
would have to be B A . Since the case B = A is trivial, we may 
assume B ^ A. Any vector orthogonal to B A is not moved at all 
and since A is to be changed we must have A(B - A) = AB 7* 0. If 
this is true, then A is moved by a multiple of B A. Since we 
still have the c at our disposal we can indeed achieve <r(A) = B. 
To repeat: A can be moved into B either if A and B are not ortho- 
gonal, or if A = B. 

Suppose now that AB = 0. We can find a C such that AC ^ 
and EC ^ 0. Indeed, if (A)* = (B)*, select C outside the hyperplane 
(A)*. If (A)* * <>* let D t (A)*, D * (B)* and E e <B>*, E j (A)*; 
for C = D + E we have AC = AE ^ and EG = BD 9* 0. Now 
we can move A into C and then C into B. A non-zero vector A can 
be moved into some other non-zero vector B by at most two sym- 
plectic transvections. 

Let now N l , M i and N 2 , M 2 be two hyperbolic pairs. By at 
most two transvections we can move NI into N 2 and these trans- 
vections will move the hyperbolic pair NI , M l into a pair N 2 , M< 3 . 
We contend that by at most two more transvections we can move 
N 2 j M 3 into N 2 , M 2 and thus have moved NI , M l into N 2 , M 2 by 
at most four symplectic transvections. What we still have to show 
amounts to the following: if N, M and N, M' are hyperbolic pairs, 
then we can find at most two transvections which keep N fixed 
and move M into M '. 

Case 1). MM' ^ 0. Then we can move M into M' by one trans- 
vection whose direction is M ' M. Notice that N(M f M) = 
NM' NM = 1 1 = 0; AT remains fixed under this transvection 
and we are done. 

Case 2). MM 1 = 0. We first see that N, N + M is also a hyper- 
bolic pair and M(N + M ) = MN =1^0. Therefore we can 
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move AT, M into AT, N + M. Now we have (N + M)M' = NM' = 
1 5^ and can, therefore, move N, N + M into AT, M'. This finishes 
the proof. 

Let U be a non-singular subspace of V so that 

V = U J. 7*. 

If r is a transvection of [/*, then 1^ _L r is obviously a transvection 
of F. This shows: 

If NI , M l and N 2 , M 2 are hyperbolic pairs of [/*, then we can 
move 2V, , M! into AT 2 , M 2 by at most four transvections of V such 
that each of these transvections keeps every vector of U fixed. 

Let now G be the subgroup of Sp n (k) which is generated by all 
symplectic transvections. Let a e Sp n (k) and A^ , M t , AT 2 , M 2 , 
, N r , M r be a symplectic basis of V. Denote the images of this 
basis under a by N[ , M{ , N 2 , M 2 , , N' r , M' r . There is an element 
TI e G which moves NI , M l into N( , M{ . Suppose we have con- 
structed an element r t e G which moves the first i of our hyperbolic 
pairs AT, , M 9 into the corresponding N' v , M' 9 . The images of our first 
symplectic basis under r, will form a symplectic basis 

NI , M{ , NI, M' 2J - - , AT: , M: , tf ," + i , Affi! , - , AT r ", M r ". 

Let C7 = (2V( , M( , N' 2 , M 2 , - , ATJ , M(), then the pairs A r J + , , M' i+ , 
and ATJi, , M(\ v belong all to U* (v > 1). Find a r e G which keeps 
every vector of U fixed and moves the pair N"+i , Af.'ii into the 
pair N(+i , Af J+i . The element TT { will move the first i + 1 pairs 
Af, , M, into the corresponding pairs N' v , AfJ . This shows that there 
is an element of G which moves the whole first basis into the second 
and we obtain G = Sp n (k). 

Now we will see that det a = +1 for all elements of Sp n (k). It 
is indeed enough to show this if a is a transvection. Nothing is to 
be proved if the characteristic of k is 2; for, in this case 1 = +1. 
If it is not 2 we can write <r c = <r 2 c/2 and our contention follows. 

THEOREM 3.25. Every element of Sp n (k) is a rotation (i.e., has 
determinant +1). The group Sp n (k) is generated by the symplectic 
transvections. 

If o- is a transvection ^ 1 with direction A, then <rX X e (A) 
for all X e V. If r t Sp n (k), then 

T*T~ 1 X - X = r(a(r"X) - (r' l X)) e (rA) 
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which shows that ro-r" 1 5^ 1 is a transvection with direction rA. 
Suppose now that r belongs to the center of Sp n (k). Then certainly 
T<TT~ I = cr for our transvection and, consequently, (A) = (rA). 
The element r leaves all lines of V fixed and Theorem 3.14 implies 
T = dbl r . 

THEOREM 3.26. The center of Sp n (k) consists of lr . It is of 
order 2 if the characteristic of k is 5^ 2, of order 1 if the characteristic is 2. 

The second proof proceeds in an entirely different manner. Let 
V be a symplectic space, possibly singular. Take any basis of V 
and compute its discriminant G. If V is singular, then G = 0; if 
V is non-singular, then V has a symplectic basis whose discriminant 
is +1. By formula (3.7) we see that G is the square of an element of k. 

Lew now (</,,) be any skew symmetric matrix. It induces on a 
vector space with given basis A,- a symplectic geometry if we define 
Ai : Aj = Qa . It follows that the determinant of any skew symmetric 
matrix with elements g ti e k is the square of an element of k. Should 
n be odd, then the symplectic space is singular, the determinant is 
zero. We shall assume from now on that n is even. 

Let Q be the field of rational numbers. Adjoin to Q the n(n l)/2 
independent variables #,-, (1 < i < 3 < n ) an( l call k the resulting 
field of rational functions of the #,,- with rational coefficients. Define 
XH to be and put x ti = x it for i > j. The matrix (#,,-) is skew 
symmetric. Hence 

(3.42) det(*,.,) 

where / and g are polynomials in the #,/ (i < j) with integral co- 
efficients. The reader will be familiar with the fact that one has 
unique factorisation in the ring of polynomials with integral co- 
efficients. We may assume that g is relatively prime to /. The left 
side of (3.42) is a polynomial with integral coefficients; it follows 
that f 2 is divisible by g 2 which implies that g is a unit, g = 1. 
The identity (3.42) may now be simplified to 

det(*, f ) = f 2 

and / is determined up to a sign. This is a polynomial identity 
which must remain true in any field and for any (skew symmetric) 
special values for the z t , . If we specialize the x ti to the g it of a 
symplectic basis, then the left side becomes 1. We may, therefore, 
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fix the sign of / by demanding that / should take on the value 1 
for a symplectic basis. 

THEOREM 3.27. There exists a polynomial called the Pfaffian and 
denoted by Pf(x ii ), which has integral coefficients and the following 
property: If (g if ) is skew symmetric, then 

detfo,,) = (P/W- 

// the g {i come from a symplectic basis, then Pf(ga) = 1. 

Adjoin now to Q not only the x ti but n 2 more variables a t/ and put 



It is easily seen that (?/,,-) is skew symmetric. Taking determinants 
on both sides and extracting a square root we find 

JY(tf) = -P/(x,,)-det(a,,) 

where e = 1. To determine e specialize the (a tj -) to the unit matrix 
when y ti = x {i . This shows = +1. 

THEOREM 3.28. // (g ti ) is skew symmetric and (a,,-) any matrix, 
then 



Suppose that (a,/) is a matrix describing an element of Sp n (k) 
and suppose P/(gf,-,) ^ which means that the symplectic space is 
non-singular. Then (a/ t )(5r t/ )(O = (jr<,) and we get det(a t ,-) = +1, 
another proof of Theorem 3.25. 

If V is a non-singular symplectic space of dimension n and 
(</,,) any skew symmetric matrix, we can always find vectors 
AI , A 2 , , A n 'mV (not necessarily a basis) such that g ti = A+Aj . 
This may be proved as follows: Let F be a space with basis 
B! , B 2 , , B n . Impose on V a symplectic geometry by putting 
BfBf = g ti . This space V may be singular, 

7 = V l JL rad 7 . 

Let B t = C t + D t where C< e F t , D, e rad 7 . Then g it = B.JS,- = 
C.C/ . We have g {i = C t C,- where the vectors C t - come from the 
non-singular space V l . Imbed now Vi in a non-singular space V 
of the right dimension. 
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We may take in V a symplectic basis 
7,,- = E.E, (then P/(y,,) = 1) and set 



, E n , put 



Then 



or 



Theorem 3.28 shows 
(3.43) 



= det(a,,). 



From (3.43) one can get all the properties of the Pfaffian which 
we formulate as exercises. 

1) If one interchanges the r-th and s-th row in (0,,) and simul- 
taneously the r-th and s-th column, then P/(</,) changes the sign. 

2) If one multiplies the r-th row and the r-th column of (</,,) 
by /, then P/(<7,-/) takes on the factor t. 

3) Pf(x t j) is linear in the row of variables x ri (r fixed, i = 1, 2, 
, n). Denote by C r , the factor of x rt in Pf(x lt ) (r < s). 



4) C 12 = 
and more generally 

c r . = p/((*,.,),..,v,..:K-ir- 1 , r < s . 

5) Prove, for instance, the expansion 

Pffra) = Si2C 12 + Xi*C l3 + - + x ln C ln . 

6) For n = 2, 
for n = 4, 

7) If (a,,-) is any matrix of even degree with column vectors 
Ai and if one defines a "symplectic product" of column vectors by 



a s / a 4l 

then det(a tj ) can be computed by means of 

det(a,,) = 
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6. Geometry over finite fields 

Let k = F q be a field with q elements, F* q the multiplicative 
group of non-zero elements of F q . Squaring the elements of F* is 
a homomorphism whose kernel is 1. If the characteristic is 2, 
then +1 = 1, the map is an isomorphism, every element is a 
square. If the characteristic is 5^ 2, then the kernel is of order 2, 
the image group has order (q l)/2 and has index 2 in the whole 
group F* . If g is a special non-square, then every non-square is of 
the form gy 2 . 

Let V be a vector space of dimension n over F q with a non-singular 
orthogonal geometry. We investigate first the lowest values of n. 

1) n = 1, V = (A}. A 2 may have the form a 2 or the form go, 2 ; 
changing A by the factor a we may assume that either A 2 = 1 or 
A 2 = g. This gives two distinct geometries: If A 2 = 1, then the 
squares of the vectors j of V are squares in F* ; if A 2 = g, then 
they are non-squares of F* . 

2) n = 2. A plane over F q contains q + 1 lines, (A + xB) and 
(B), if V = (A, B). Attach to V a sign e = 1; if V is hyperbolic 
put = +1, if V contains no isotropic line put = 1. The number 
of non-isotropic lines of V is q 6 (there are just 2 isotropic lines if 
e = +1). The number of symmetries in 0(V) is, therefore, q c. 
These symmetries form the coset of + (V) in 0(7) which is ^ + (V). 
It follows that 0*(V) contains also q c elements. 

Each of the g c non-isotropic lines contains q 1 non-zero 
vectors. The total number of non-isotropic vectors in V is, therefore, 
(<7 1)(<7 ). The q e rotations produce q 1 equivalence 
classes among the non-isotropic vectors of V. An equivalence class 
consists (by Witt's theorem) precisely of the vectors with equal 
squares. We see that every element of F* q must appear as square of 
some vector of V. 

Let A be a vector of V with A 2 = 1, B a non-zero vector orthogonal 
to A; changing B by a factor we may assume that either B 2 = 1 
or B 2 = -g. Since V = (A, B) and (Ax + By) 2 = x 2 + B 2 y 2 we 
see that V is hyperbolic if B 2 = 1 and not hyperbolic if B 2 = g. 
The sign really describes the geometry completely; if c = 1, 
then x 2 gy 2 is the quadratic form attached to our geometry. 

For the total number of isotropic vectors in V we find q + eq 
(2q 1 if V is hyperbolic, only the zero vector if V is not hyperbolic). 
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3) n > 3. Let P be any non-singular plane of V and B a non- 
isotropic vector of P*. We can find in P a vector C such that 
C 2 = -B 2 . Then C + B ^ but (C + Bf = 0. V must contain 
isotropic vectors. 

According to Theorem 3.11 we can write 

V = H 2r _L W 

where H 2r is hyperbolic and where W does not contain non-zero 
isotropic vectors. We see that we have four possible types of geo- 
metries (P,- = hyperbolic planes): 

n odd 

(I) V = P, _L P 2 J. - J. P (n - 1)/2 1 (A), A 2 = 1, 

(II) V = Px JL P 2 J- - - J_ P (n - 1)/2 <A>, A 2 = 0, 
n even 

(III) 7 = P, _LP a P n/2 , 

(IV) P = Pi J. P, JL J- PC.-,,,, -L W, 

where TF is a plane with c = 1. 

The quadratic forms attached to these four types are: 



(I) 

(II) 2x,x 2 + 2x 3 x 4 + - - + 2x n _ 2 x n _ 1 + gxl , 

(III) 2x t x 2 + 2x 3 x 4 + + 2x n _ 3 ^- 2 + 2x n _!X n , 

(IV) 2XiX 2 + 2x 3 x 4 + + 2x n _ 3 x_ 2 + xl,, - gxl 



The discriminants are (-J) (n - 1}/2 , (-l) (n " u/ V, (-l) n/2 , (-l) n/ V It 
is worth while noticing that n together with the quadratic character 
of the discriminant determines the type of geometry. It will also 
be important to remember that if V = PI _L Vi , where PI is a 
hyperbolic plane, then V and V l are of the same type. 

The difference between type I and type II is inessential. If we 
multiply the quadratic form of type I by g it becomes equivalent to 
the form of type II. The difference between type III and type IV is 
very great; a maximal isotropic subspace is of dimension n/2 for 
type III and of dimension n/2 1 for type IV. 
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We associate again a sign e = +1 with type III and e = I 
with type IV. 

Denote now by <p n the number of isotropic vectors of V for some 
fixed type of geometry. If n > 3 and N, M is a hyperbolic pair of V, 
then (A/", M)* has dimension n 2 and is of the same type as V; 
(N, M)* contains, therefore, ^ n _ 2 isotropic vectors. 

The space (A^) J_ (N, M}* has dimension n 1 and is orthogonal 
to AT; hence it is the space (N)*. Let zAT + A be a vector in this space 
where A t (AT, M}*. It will be isotropic if and only if A is isotropic. 
Thus (N)* contains qv> n _ 2 isotropic vectors. To get <p n we must 
also find out how many isotropic vectors are not orthogonal to N. 
Such a vector will span a hyperbolic plane with N. A hyperbolic 
plane containing N is obtained in the form (N, B) provided NB j 0. 
There are q n vectors in V and (f~ l of them are in (N)* so that we have 
q n - q n ~ l = q *~ l (q - i) suc h vectors B. Each plane (N, B) will 
contain, by the same reasoning, q 2 q = q(q 1) vectors C (not 
orthogonal to AT) which together with N span the same plane (AT, J3). 
We obtain, therefore q(q 1) times the same plane if we let B range 
over the q n ~ 1 (q 1) vectors. Consequently we get q n ~ 2 different 
hyperbolic planes which contain the given N. Each such plane 
contains exactly one hyperbolic pair AT, M whose first vector is the 
given N. It will contain the isotropic vectors xM (with x ^ 0) 
which are not orthogonal to N] there are q 1 of them in the plane. 
It follows that there are q n ~ 2 (q 1) = q n ~ l q n ~ 2 isotropic vectors 
not orthogonal to N. To summarize: 

q<p n -2 vectors are isotropic and orthogonal to AT, 

q n ~ l q n ~ 2 vectors are isotropic and not orthogonal to AT, 

q n ~ 2 is the number of hyperbolic pairs with first component N. 

We obtain for <p n , if n > 3, 



or 

<p n - <r l = q(* n -2 - <r 3 ) 

or 

-n/2/ ~*~1\ -,-(n-2)/2/ ^ n ~^\ 

<1 (<Pn - q ) = q (<f>n-2 q ) 

This means that q~~ n/2 (<p n g"' 1 ) has the same value c for all 
n > 1 (and for each type of geometry), in other words, 

*n= <T' +cq n/2 , n>l. 
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For type I or II, n is odd and <?i = 1 (0 is the only isotropic vector 
if n =1). This shows that c = 0, hence 



. For type III or IV, we get for n = 2: <p 2 = q + eg so that 
c = e /g. Therefore, for even n, 



There are <p n 1 isotropic vectors TV 5^ 0. With each of them there 
are associated q n ~ 2 hyperbolic pairs whose first component is N. 
Thus there are \ n = q n ~ 2 (<p n 1) hyperbolic pairs in V. We have 

\ n = q n ~\q n ' 1 - 1) if n is odd, 

X n = q*-\q n/ * - eXg 0172 '- 1 + c) if n is even. 

Before we go on let us figure out the number X n of hyperbolic 
pairs in a symplectic case; n must then be even but k may have any 
characteristic. 

Every vector N is isotropic so there are q n 1 non-zero isotropic 
vectors. As before, there are q n q n ~ l vectors B not orthogonal to 
N and q 2 q of them span the same hyperbolic plane (N, B). This 
gives q n ~ 2 hyperbolic planes (N, B). In each plane we have q 2 q 
vectors B not orthogonal to N and these B give lines (JS); q 1 
among the B give the same line. (N, B) contains q lines not orthogonal 
to N. On each such line one can find one M such that NM = 1. 
We have, therefore, q n ~ l hyperbolic pairs with first component N. 
Since there are q n 1 such N we have 



if V is symplectic. 

Denote now by $ n either the order of Ot for each of our types, 
or the order of Sp n (k). A given hyperbolic pair AT, M can be moved 
by a or into any of the X n other hyperbolic pairs. If a and r move it 
into the same pair, then r~ l cr = p will leave N, M fixed. Writing 
V = (N, M) (AT, M)* we see that p=*lu?u* where U = (AT, M) 
and where pu* is one of the elements of the group of (AT, M)* whose 
order is 3> n _2 Therefore 
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where n > 3 in the orthogonal case, n > 2 in the symplectic case 
(we needed the existence of a hyperbolic pair). 
If n is odd, then 4>j = 1 (identity the only rotation) hence 



(n-U/2 
t-1 

If n is even and V orthogonal, then < 2 was determined to q 
and we have 

$ = X n X n _ 2 X 4 (? c) 

(n/2 \ 

n (* - )('-' + ))(? - > 
t-2 / 

(n-2)/2 
t-1 

If V is symplectic we have $ = 1 and get 

n/2 
t-1 

The order <f> n of our group is, therefore, given by 

g(-nv4 B j-j ^2 _ ^ jj ^ Q^ 7 orthogonal 

t-i 

(n-2)/2 

g n(n - 2)/ V /2 - ) II (<Z 2t - 1) if n even, 7 orthogonal 

n/2 

gf (B/2)t . J~| (g a * 1) if nev en, V symplectic. 

Remember that < n denotes the order of Ot in the orthogonal 
case and not the order of O n . 

Consider again the case of an orthogonal geometry. Let U be 
a space of dimension r < n with a given non-singular geometry. 
Let IF be a space of dimension n r; giving to W the two geometries 
possible for dimension n r and forming U _L W = V we get two 
possible geometries on V. These geometries are not the same because 
of Theorem 3.12, since W = U*. This shows that the geometry of W 
can be selected (uniquely) in such a way that V receives a given 
one of the two possible geometries. In other words: If V is a given 
non-singular space of dimension n, then all types of geometries 
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occur among the non-singular proper subspaces of V. One may ask 
how many subspaces U exist with a given non-singular geometry. 
Let U be one of them and write 

V = U _L U*. 

The discriminants of V and U will determine that of U* and 
hence the geometry of U*. If we apply all elements of + (F) to 
U we will get all isometric subspaces. One has only to find how 
many elements o- of + (V) map U onto itself. Then U* will also be 
mapped onto itself, hence <r = r JL p where r e 0(C7), p e 0(f/*). 
Since a- has to be a rotation det r = det p. If we denote by $(F) the 
order of + (F) we get $(E7)$(C7*) possibilities where T and p are 
rotations and just as many where r and p are reflexions. This shows 
that there are 



$([/*) 

subspaces of F, isomorphic to f/. Similar questions can be discussed 
for isotropic subspaces or for given sets of independent vectors 
( < n in number) which we leave to the reader. 

7. Geometry over ordered fields Sylvester's theorem 

Suppose that k is an ordered field and V a vector space over k 
with an orthogonal geometry. 

DEFINITION 3.15. The geometry of V is called 

a) positive semi-definite if X 2 > for all .Y c K, 

b) negative semi-definite if X 2 < for all X e V, 

c) positive definite if X 2 > for all X ^ iu K, 

d) negative definite if X 2 < for all X ^ in V. 

Suppose V is positive semi-definite and let Y e V be such that 
y 2 = 0. For any X e V and a e k we have 

(X + aF) 2 = Z 2 + a(XF) > 0. 

If XY were /^ 0, then X 2 + a(XY) would range over all of k as a 
ranges over k and would take on negative values; therefore XY = 
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for all X e V. Hence the radical of V consists of all Y e V with Y 2 = 0; 
a positive semi-definite space is non-singular if and only if it is 
positive definite. The same conclusions hold for negative semi- 
definite spaces. Observe that the 0-space is positive definite as well 
as negative definite. 

Assume now merely that V is non-singular and let U be a positive 
definite subspace, W a negative definite subspace. For X z U C\W 
we conclude X 2 > 0, X 2 < 0, hence X = 0, U C\ W = 0. Therefore 
dim(C7 + W ) = dim U + dim W and, since dim(J7 + W) < n, 
we get dim U + dim W < n. 

Suppose now that U is not properly contained in any positive 
definite subspace of V (U is maximal positive definite). We have 
V = U _L U*. If U* would contain a vector Y with Y 2 > 0, then 
U JL (Y) would clearly be positive definite; therefore Y 2 < for 
all Y e U*. This means, since U* is non-singular, that U* is negative 
definite. We call r the dimension of this particular U. Then we know: 

a) Whenever W is negative definite, then dim W < n r. 

b) The value n r is reached, namely for W = U*. 

This is an invariant characterisation of the number r and we 
conclude that all maximal positive definite subspaces have the 
same dimension r. 

We just need one more piece of information. Suppose U is positive 
definite. When will it be maximal positive definite? We certainly 
must have U* negative definite. Is this sufficient? If C7* is negative 
definite of dimension n s, then any positive definite space has 
dimension < n (n s) = s. Thus U (which has dimension s) 
is maximal positive definite (and of course s = r). 

It is easy to interpret these results in terms of an orthogonal 
basis AI , A 2 , , A n of V. Assume A l , A 2 , , A r are those 
basis vectors whose squares are > and let U = (A l , A 2 , , A r ) 
(U = if r = 0). Then C7* = (A rlrl , A rf2 , - - - , A n ) is negative 
definite and U, therefore, maximal positive definite. Thus the 
number r is our previous invariant. 

The statements we have just made are known as Sylvester's theorem. 

Our new invariant r does not (in general) describe the geometry 
of V completely. It does so, however, in one important case. Assume 
that every positive clement of k is a square of an element of fc. This 
holds, for instance, if k = R, the field of real numbers. If we replace 
each basis vector A< by a suitable a<A< we may assume that for the 
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new basis vectors we have A* = I for i < r and A 2 , = 1 for 
i > r + 1. The geometry is based on the quadratic form 



We get n + 1 such geometries since r = 0, 1, , n are the possible 
values for r. The point is that Sylvester's theorem assures us that 
these geometries are distinct. 



CHAPTER IV 
The General Linear Group 

1. Non-commutative determinants 

J. Dieudonn6 has extended the theory of determinants to non- 
commutative fields. His theory includes the case of ordinary deter- 
minants and we shall present it here. Let A: be a field which is not 
necessarily commutative. Let A = (a<,) denote an n-by-n matrix 
with elements in k where the letter i designates the row and j the 
column of A. For any i 9* j and any X c k, we denote by J?,,(X) the 
matrix obtained from the unit matrix by replacing the element 
a,/ = of the unit matrix by X. 

To multiply any matrix A on the left by # t -,(X) amounts to adding 
to the i-ih row of A the j-th row multiplied on the left by X. To 
multiply A on the right by #,(X) amounts to adding to the j'-th 
column of A the i-th column multiplied on the right by X. Specifically, 
B,,(X)J?<,GO = tl .(X + M), and B f ,(X)- 1 = B. f (-X). 

If A i , A 2 , , A n are the row vectors of A, then the row vectors 
of BA are left linear combinations of the A v . A is non-singular if it 
has an inverse #, i.e., if the n unit vectors can be written as left 
linear combinations of the A, . This is the case if and only if the 
A, are left linearly independent. 

The group of all non-singular n-by-n matrices is called the general 
linear group and denoted by GL n (/c). The matrices #,,(X) (for all 
i ? j and all X e k) generate a subgroup, SL n (k), called the unimodular 
group. Its elements shall be called unimodular matrices. 

Let A be non-singular. We are going to multiply A on the left by 
various B fi (\) and achieve an especially simple form. We shall then 
have multiplied A on the left by a certain unimodular matrix B. 

Since A is non-singular, not all elements a,-i of the first column 
can be zero. If n > 2 and a 2 i = 0, we add a suitable row to the second 
row and obtain a matrix with a 2l ^ 0. Now we multiply the second 
row by (1 a n )aai and add it to the first row. This gives a new 
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matrix with a n = 1. Now we multiply the first row by a tl and sub- 
trlact it from the i-th row (i > 1). We have then a non-singular 
matrix with a u = 1 and o tl = for i > 1. 

The n 1 rows A 2 , , A n of this matrix are left linearly inde- 
pendent; thus we can treat the second column in a similar way 
and obtain a 22 = 1, ? = for i > 2 if n > 3. But we can also 
achieve a i2 = by subtracting a multiple of the second row from 
the first. 

We are stopped in this procedure only when the matrix differs 
from the unit matrix just by the last column. The element a nn = /z 
must then be 7 0, otherwise the last row would contain only zeros. 
Therefore, we can at least achieve a tn = for i < n 1, by sub- 
tracting multiples of the last row from the others. We end up with 
a matrix D(n), differing from the unit matrix only in the element 
a nn which is /z. Thus we have 

THEOREM 4.1. Every non-singular matrix A can be written in 
the form S-Z)(/z) with B e SL n (k) and some /z 7^ 0. 

To multiply A on the left by D(IJL) amounts to multiplying the 
last row from the left by /x- To multiply A on the right by 
amounts to multiplying the last column of A on the right by 
Specifically, 



The elements /^ of k form a multiplicative group /c*._Its factor 
commutator group is abelian and shall be denoted by k*. To this 
group, we adjoin a zero element with obvious multiplication, and 
call the semi-group thus obtained k. Every a 7* of k has a canonical 
image d__in k. With_the zero element^ of k^ we associate the zero 
element of k. Then ab = ab = bd= ba and 1 is the unit element of k. 
We frequently write simply 1 instead of 1 and instead of 0. Caution 
is necessary j^vith 1 since it may happen (e.g., in the quaternions) 
that -1 = 1. 

We associate now with every matrix A an element of k called 
determinant: dot A. We shall ask that det A satisfies the following 
axioms: 

1) Suppose A' is obtained from A by multiplying one row of A 
on the left by /i. Then 

det A 7 = /zdet A. 
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2) If A' is obtained from A by adding one row to another, then 

det A' = det A. 

3) The unit matrix has determinant 1. 

Since we are going to prove the existence of determinants by 
induction on n, we first derive some consequences of the axioms: 

a) If we add to the row A t a left multiple XA, of another row, 
the determinant does not change. 

Proof: This is obvious if X = 0. Let X 9* 0, and A' be the matrix 
obtained from A by replacing A, by XA, . Then det A = X" 1 det A'. 
We now add the row XA, of A' to A t and factor out X from thej-th 
row. 

b) If A is singular, then det A 0. Indeed, one row vector is a 
left linear combination of the others. Subtracting this linear com- 
bination from it, we get a matrix with one row = 0. Assume A has 
this form. We factor out from this row and have det A = 
det A = 0. 

c) If A, arid A i are interchanged, then det A is multiplied by 1. 
Proof: Replace A t by A t + A,- ; subtract this from A, (obtaining 

A { ) and add this new row to the t'-th row. This interchanges A, 
and A, but with a sign change. Factor out 1. 

d) dot 7)(ju) = jz. Factor out /z from the last row and use axiom 3). 

e) If A is non-singular and of the form B-D(n) with unimodular 5, 
then det A = /z. This follows from d), since multiplication by B 
means repeated operations which do not change the determinant. 

f) The axioms are categorical. This follows from b) and e). 

g) det A = if and only if A is singular. This follows also from 
b) and e). 

h) det(A#) = det A- det B. 

Proof: If A is singular, AB can not have an inverse, since ABC = / 
(where / is tho unit matrix) implies that BC = A" 1 . Therefore, 
det A = det A B = 0, and our formula holds. 

If A = CD(p) is non-singular where C is unimodular, then 
det A = /Z. The matrix D(p,)-B is obtained from B by multiplying the 
last row by /z. Therefore, det(D(n)B) = /z det B. Multiplying D()B 
on the left by the unimodular matrix C does not change the deter- 
minant. Hence, 

det(AJB) = det(DOi)B) = M det B = det A det B, 
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i) Since det(A-B t ,(\)) = det A, it follows that det A does not 
change if a right multiple of one column is added to another. 

j) If a column of A is multiplied on the right by ju, then the 
determinant is multiplied by /z. This follows for the last column 
from det(AD(/i)) = det A -/z and for the others by column exchange. 
The rule for column exchange follows from i) as in c); one obtains 
it at first only if the last column is interchanged with another. 
This is enough to prove j) and one can now prove the full rule for 
column exchange. 

Existence of determinants 

1) For one-rowed matrices A = (a), we set det A = a, and see 
that the axioms are trivially satisfied. Let us suppose, therefore, 
that determinants are defined for (n 1) -rowed matrices and that 
the axioms are verified. 

2) If A is singular, put det A = 0. A being singular means that 
the rows are dependent. Then axioms 1) and 2) are true since the 
new rows are also dependent. 

3) If A is non-singular, then the row vectors A, are left linearly 
independent, and there exist unique elements X t e k such that 
^-1 X,A, = (1, 0, - - ,0) (therefore, not all X t = 0). We write 
A, = (0,1 , B.) where 5, is a vector with n 1 components. Then 

X F a.t = 1 and XJ3, = 0. 



Consider now the matrix F with n rows B { (and n 1 columns). 
Call d the (n l)-by-(n 1) matrix obtained from F by crossing 
out the t-th row. We wish to get information about det C, . 

a) If X, = 0, then ^,-1 \ V B, = shows that the rows of C t are 
dependent, and consequently det C, = 0. 

b) Suppose X, 5^ and X,- 5^ (i 7* j). Call /) and E the matrices 
obtained from C t by replacing E, by X ; B y and B t , respectively. 

We have: det d = X^ 1 det D. If we add to the row X,B,- all the 
other rows multiplied on the left by X, , we obtain in this row 
Hrn, ^*B> = XA Factoring out X< , we get 
det C, = Xf r (- X<)det E. 

With \i j\ 1 interchanges of adjacent rows, we can change the 
matrix E into the matrix <?,- . Hence we have 



det C, - (-I)'" 1 - 1 XJdet C, , 
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or 

(4.1) (-1) <+1 Xf 1 det C< = (-l)' +l Xf 1 det C, . 

The expression (4.1) is, therefore, the same for any X, 5^ and shall 
be called the determinant of A : 

det A = (-1)'* 1 Xf 1 det C, . 

Next we have to prove our axioms. 

1) Suppose A< is replaced by jiA. . If n = 0, the axiom holds 
trivially. Let n j 0. We have then to replace X t by X./x" 1 and to 
keep the other X, fixed^If X t ^ 0, then C t is not changed; the factor 
X"; 1 is replaced by p, X7 1 , and the axiom is true. If X, 5* for v j i, 
then det C, changes by the factor /z, and the axiom is again true. 

2) If At is replaced by A t + A, , then X, is replaced by X,- X< 
and all other X, stay the same; for, 

X,(A, + A t ) + (X,- - X)A, = X.A, + X,-A, . 

If some X, is ^ for v 7* i, j, we use it for the computation of the 
determinant and see that det C F does not change since a row is 
added to another. If X 5^ 0, then X t arid C, stay the same. There 
remains the case when X, -^ but all other X, = 0. Then X,J5, = 0; 
therefore, B, = 0. Since B t has to be replaced by B t + B f , no 
change occurs in C, ; X,- also does riot change, and our axiom is 
established in all cases. 

3) For the unit matrix, we have X t = 1, all other X, = 0. d is 
again the unit matrix whence det 1=1. 

THEOREM 4.2. Let a, 6 e k and j* 0. Let c = aba' 1 !!)" 1 . Then 
D(c) is unimodular ifn>2. 

Proof: We start with the unit matrix and perform changes which 
amount always to adding a suitable left multiple of one row to another. 
The arrow between the matrices indicates the change: 

/i o\_^A ( A->/' ~ a \_>/ ~~ a \ 

\o i/ V 1 i/ V 1 i/ V 1 6-V 

/aba' 1 \ /aba" 1 \ /O -c \ 
\ a- 1 6"V \ 1 b- 1 / \1 b~) 

-c i-c 3-c :) 
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The steps are only written out in 2-rowed matrices and are meant 
to be performed with the last two rows of n-rowed matrices. They 
show that D(c) is obtained from the unit matrix by multiplication 
from the left by unimodular matrices, and our theorem is proved. 

We can now answer the question as to when det A = 1. If we 
put A = BD(IJL) with an unimodular 5, then det A = /z. But /z = 1 
means that /* is in the commutator group, therefore a product of 
commutators. This shows that D(p,) is unimodular and, hence, A. 

If n = 1, det A = det(a) = a. To get a uniform answer, we may 
define SL^k) to be thejjpmmutator group of fc*. 

The map GL n (k) k* given by A > det A is a homomorphism 
because of the multiplication theorem for determinants. The map is 
onto since /)(M) > /z, and since /z can be any element of k*. Its 
kernel is SL n (k). SL n (k) is, consequently, an invariant subgroup 
of GL n (k) whose_f actor group is canonically (by the determinant map) 
isomorphic to k*. Thus we have 

THEOREM 4.3. The formula det A = 1 is equivalent with A e SL n (k). 
SL n (k) is an invariant subgroup of GL n (k}] it is the_kernel of the map 
A > det A, and the factor group is isomorphic to k*. 

THEOREM 4.4. Let 

A = 

where B and D are square matrices. Then 

det A = det B -dot D. 
The same is true for 





Proof: 

1) If B is singular, its rows are dependent and, therefore, the 
corresponding rows of A. Our formula holds. 

2) If B is non-singular, we can make unimodular changes (and 
the same changes on A) to bring it into the form D(JJL). Then 
det B = /z. Subtracting suitable multiples of the first rows of A 
from the following, we can make C = 0: 
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If D is singular, so is A. If D is non-singular, we can assume that 
D is of the form D(p) where det D = p. Factoring out of A the ju 
and the p, we are left with the unit matrix, and det A = /zp = 
det B-dGiD. 

For A in the second form, we can argue in the same way starting 
with D. 

Examples for n = 2 



1) 



2) 



0; 



u p 
7 S 


~~~ 


1 


7 o 
j8 


= -1 7 


= -7/3- 




a 8 


= 


Of 

5 


- 7~'0 


f -l/> 

= ao aya p 



3) Suppose a/> 5^ ba, then 
1 a 
fc a& 



= ab - ba 7* 0. 



This shows that the right factor 6 of the last row can not be factored 
out since otherwise we would get 0. 

We shall now investigate the substitutes for the linearity of 
determinants in the commutative case. 

Let fc' be the commutator group of the non-zero elements of fc. 
The element a can then be interpreted as the coset a/c', and we have 
trivially: ab = db ab. Define addition as follows: a + b = ak' + bk' 
is the set of all sums of an element of a and an element of 5. Then 
a + b t a + 6; therefore, (a + b)c C ac + be. Notice that such a 
relation means equality in the commutative case since then a = a, 
and a + 6 = a + 6. 

Consider now the determinant as a function of some row, say 
the last one, A n , keeping the other rows, A t , fixed. Denote this 
function by D(A n ). 

THEOREM 4.5. 



D(A n 



D(A' n ). 
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Proof: The n + 1 vectors AI , A 2 , , A n , Ai are linearly de- 
pendent; this yields a non-trivial relation 

Pl A, + - + pn-iA^ + \A n + AZ = 0. 

1) If both X and /i are 0, the first n 1 rows of our three deter- 
minants are dependent; the determinants are 0, and our theorem 
holds. 

2) If, say, A j& 0, we can assume X = 1. Adding suitable multiples 
of the first n 1 rows to the last, we find 

D(A n ) = D(-/*A3 = 
and 



From 



we get our theorem. 

COROLLARY. If k is commutative, the determinant is a linear 
function of each row. 

2. The structure of GL n (k) 

We shall now derive some lemmas about fields which will be 
necessary later on. Let us denote by 

k any field, 

Z the center of k which is a commutative subfield of A, 

S the additive subgroup of k which is generated by the products 
of squares of /c; an element of S is, therefore, of the form 
S =t ^2 x* . Notice that S + S C S, S- S C S but also that 
c c S implies c' 1 = c- (c" 1 ) 2 e S if c ^ Q. 

LEMMA 4.1. // the square of every element of k lies in Z, then k 
is commutative. 
Proof: 

1) Since xy + yx = (x + y) 2 x 2 ?/ 2 , we know that every 
element of the form xy + yx belongs to Z. 

2) If the characteristic of k is j* 2, we may write for x e k 
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which shows that x t Z. 

3) If k has characteristic 2 and is not commutative, we can find 
two elements a, b e k such that ab 7* ba and thus c = a& + 6a 7* 0. 
By 1) we know c e Z and ca = a(ba) + (bd)a is also in Z. This implies 
a e Z and, hence, ab = 6a which is a contradiction. 

LEMMA 4.2. Unless k is commutative and of characteristic 2, we 
have S = k. 
Proof: 

1) Formula (4.2) proves our lemma if k does not have the charac- 
teristic 2. 

2) Suppose k has characteristic 2 and is not commutative. Then 
xy + yx = (x + 2/) 2 - x 2 y 2 shows xy + yx t S. 

By Lemma 4.1 there exists an element a c k such that a 2 4 Z. We 
can, therefore, find an element b z k such that a 2 b 7* ba 2 . The element 
c = a*b + ba 2 will be * and in S. Let x be any element of k. Then 

cs = a 2 bx + ba 2 x = a 2 (6z + xb) + (a*x)b + b(a*x) 

shows that ex t S and, hence x t c~ l S C S. 

Let V be a right vector space of dimension n > 2 over k. In Chapter 
I we have shown that the set Hom(F, V) of fc-linear maps of V into 
V is isomorphic to the ring of all n-by-n matrices with elements in k. 
This isomorphism depended on a chosen basis A l , A 2 , , A n . 
If (T e Hom(V, F) and crA/ = S?-i A v a 9i , then the elements a vi 
form the j'-th column of the matrix (a,-/) associated with <r. We 
define the determinant of a by 

det ff = det(a t ,). 

If the basis of V is changed, then A = (a*/) is replaced by some 
BAB' 1 and det(BAB~ l ) = det B-det A- (det B)~ l = det A, since 
the values of determinants are commutative. The value of det cr 
does, therefore, not depend on the chosen basis. Clearly det(or) 
det (7'det T. 

It is preferable to redefine the group GL n (k) as the set of all non- 
singular fc-linear maps of V into V and the subgroup SL n (k) by the 
condition that det a = 1. We shall try to understand the geometric 
meaning of SL n (k). 
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DEFINITION 4.1. An element r t GL n (k) is called a transvection 
if it keeps every vector of some hyperplane H fixed and moves any 
vector X e V by some vector of H : rX X z II. 

We must first determine the form of a transvection. The dual 
space t^ of V is an n-dimensional left vector space over k. It can be 
used to describe the hyperplanes of V as follows: if <p j is an 
element of ^, then the set of all vectors Y e V which satisfy <p(Y) = 
form a hyperplane H of V] if \f/ e F describes the same hyperplane, 
then ^ = a? where c e k. 

Suppose our hyperplane H is given by p e t^. Select a vector 
B of V which is not in H] i.e., p(J3) = a 7* 0. 

Let X be any vector of V and consider the vector X J3-o~V(^)- 
Its image under <p is ^(X) <p(B)a~ l <p(X) = which implies that 
it belongs to H. Hence the given transvection r will not move it: 

rX - TB-a- l <p(X) = X - 
rX = X + (rCBO - Ifo 

Since Ba~ l is moved by T only by a vector of H it follows that 
A = r(Ba~ l ) /ia" 1 e // and we can write 

(4.3) rX = X + A-<p(X), <p(A) = 0. 

Select conversely any $ e ^ and any A e F f or which ^>(A) = 0; 
construct the map r by (4.3) which (in its dependency on A) shall 
be denoted by T A . 

If <p or A are zero, then T is identity, a transvection for any hyper- 
plane. Suppose <f> and A are different from 0. We will get rX = X 
if and only if <p(X) = 0, i.e., if X belongs to the hyperplane H defined 
by <f>. The vector A satisfies <p(A) = and belongs, therefore, also 
to H. An arbitrary vector X is moved by a multiple of A, i.e. by a 
vector of //. The map r is non-singular since rX = entails that X is a 
multiple of A and hence X e //; but then T X = X and consequently 
X = 0. We see that (4.3) gives us all transvections; they are much 
more special than the definition suggests in that a vector X is always 
moved by a vector of the line (A) C H which we shall call the direc- 
tion of the transvection. The notion of direction makes, of course, 
sense only if the transvection T is different from 1. 

Let A t B t H. We may compute T A T B : 
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= T B X + A-<p(r B X) 
= T B X + 



since <p(B) = 0. We obtain 

TA+B 



Let r F^ 1 be the transvection described by (4.3) and a any element 
of GL n (k). Set r' = crro-" 1 ; then 



r'(X) = crr^X) = (KcT'X + A-<p(<r-*X)) = X + aA^^X). 

The map X > <p(<r~ l X) e k is also an element of ^. To find its 
hyperplane we remark that <p(<r~ l X) = is equivalent with cr~ l X e H, 
hence with X e aH. The vector <rA belongs to vH and we see that 
r 1 = err' 1 is again a transvection which belongs to the hyperplane 
aH and whose direction is the line (<rA). Let conversely 

r'X = X + A'-^(X) 

be a transvection 5^ 1 and f/' its hyperplane. We find two vectors 
B and B' of K Avhich satisfy <p(B) = 1 and t(B') = 1. 

We determine an element a e GL n (V) such that ovl = A', o-/f = 77' 
and <rB = J9'; this can be done sinco a basis of H together with B 
will span V and similarly a basis of //' together with B' will also 
span V. Set r" = orcr" 1 . Then 

r"(X) = X + A'-^eT'X). 



There is a constant c such that <p((r~ 1 X) = c^(X) since <p(<r~ l X) 
also determines tho hyperplane aH = //'. Setting X = B', o-^X = B, 
we obtain c = 1 and find r" = r'. All transvections r 5^ 1 are con- 
jugate in GL n (k). From r' = error" 1 we get also that det r' = det r, 
i.e., that all transvections ^ 1 have the same determinant. 

We had shown that T A T B = T A B , where A and B are vectors 
of H. Suppose that H contains at least three vectors. Select A ?& 
and then B ^ 0, A. Setting C = A+B^OwG have T A T B = r c 
and all three transvections are 9* 1. They have the same determinant 
a 5^ and we get a 2 a, a = 1. The formula can also be used for 
another purpose. Let / be the canonical map of GL n (k) onto its 
factor commutator group. The image of GL n (k) under / is a commuta- 
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tive group so that /(<JTO- -I ) = f(o)f(T)f(o)~ l = /(r) showing again 
that all transvections have the same image under the map /. We 
again have /3 2 = and, therefore, ft = 1, the image of a transvection 
is 1, a transvection lies in the commutator subgroup of GL n (k). We 
must remember, however, that we made the assumption that H 
contains at least three vectors. This is certainly the case if dim H > 2 
or n > 3; should n = 2, dim H = 1, then it is true if k contains 
at least three elements. The only exception is, therefore, GL 2 (F 2 ); 
the result about the determinant holds in this case also since 1 is 
the only non-zero element of F 2 but the transvections of GL 2 (F 2 ) 
are not in the commutator group, as we are going to see later. 

Let us, for a moment, use a basis A l , A 2 , , A n of V. Let r 
be the map which corresponds to the matrix B,-,-(X); r leaves A, 
fixed if v j j. These A, span a hyperplane H. The remaining vector 
Aj is moved by a multiple of A< , hence by a vector of H. An arbitrary 
vector of V is, therefore, also moved by a vector of H; i.e., r is a 
transvection. We know that the special transvections J5 t ./(X) generate 
the group SL n (k). Thus the group generated by all transvections 
of V will contain SL n (k). But we have just seen that every trans- 
vection is unimodular, i.e., has determinant 1. It follows that SL n (k) 
is the group generated by all transvections of V and this is the geo- 
metric interpretation of the group SL n (k). 

Except for GL 2 (F 2 ) we proved that SL n (k) is contained in the factor 
commutator group of GL n (k). The factor group GL n (k)/SL n (k) is 
commutative (SL n (k) is the kernel of the determinant map) which 
implies the reverse inclusion. The group SL n (k) is the commutator 
group of GL n (k) and the factor commutator group is canonically 
isomorphic (by the determinant map) to the factor commutator 
group of fc*. 

If r(X) = X + A-<p(X) and r'(X) = X + A'-}(X) are two 
transvections j& 1 and if B and B' are vectors which satisfy <p(B) = 1 
respectively \I/(B') = 1, then select any a which satisfies aH = H', 
<rA = A 1 and <rB = B 1 proving that r' = oro-" 1 . 

If n > 3, we have a great freedom in the construction of <r. The 
hyperplanes H and H' contain vectors which are independent. 
Changing the image of a basis vector of H ' which is independent of 
A' by a factor in k we can achieve that det <r = 1, i.e., that o- c SL n (k). 
All transvections j& 1 are conjugate in SL n (k). 

If n = 2, hyperplane and direction coincide. If we insist merely 
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on ffH = H', then we can still achieve det a = 1, since B' could be 
changed by a factor. Let now r range over all transvections 5^ 1 
with hyperplane H. Then r f = CTT<T~ I will range over transvections 
with hyperplane H 1 and any given r 1 is obtainable, namely from 
r = (7~W. We see that the whole set of transvections j& 1 with 
hyperplane H is conjugate, within &L B (fc), to the whole set of trans- 
vections ^ 1 with some other hyperplane H'. 
We collect our results: 

THEOREM 4.6. The transvections 7* 1 of V generate the group 
/SL n (/c). They form a single set of conjugate elements in GL n (k) and t 
if n > 3, even a single set of conjugate elements in SL n (k). If n = 2, 
then the whole set of transvections 7* 1 with a given direction is conjugate, 
within SL n (k)j to the whole set of transvections 7* 1 with some other 
direction. 

THEOREM 4.7. Unless we are in the case GL 2 (F 2 ) the commutator 
group of GL n (k) is SL n (k) and the factor commutator group is canon- 
ically isomorphic (by the determinant map) to the factor commutator 
group of k*. 

THEOREM 4.8. The centralizer of SL n (k) in GL n (k) consists of 
those a e GL n (k) which keep all lines of V fixed. They are in one-to-one 
correspondence with the non-zero elements a of the center Z of k. If <r a 
corresponds to a t Z*, then a (X) = Xa. This centralizer is also the 
center of GL n (k). The center of SL n (k) consists of those cr a which have 
determinant 1, for which a n = 1. 

Proof: Let (A) be a given line and T a transvection with this 
direction. If a is in the centralizer of SL n (k), then it must commute 
with T and oro-" 1 = r. The left side of this equation has direction 
(vA) the right side (A). It follows that aX = Xa for any X e V 
but a may depend on X. If Xi and X 2 are independent vectors, 
then Xi , X 2 , (Xi + X 2 ) are mapped by cr onto X^a, X 2 pj (X^ + X 2 )y. 
Since X^ + X 2 p is also the image of X l + X 2 , we find that 
a = 7 = 0. If X l and X 2 are dependent, then we compare them to a 
third independent vector and find again that they take on the same 
factor. The map a a (X) = Xa is not always linear; it should satisfy 
<r a (Xp) = cr a (Z)j3 or Xfta = Xa/3 showing a e Z*. The rest of the 
theorem is obvious. 

We come now to the main topic of this section. We shall call a 
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subgroup G of GL n (k) invariant under unimodular transformation 
if <rG<r~ l C G for any a t SL n (k). We shall prove that such a subgroup 
G is (with few exceptions) either contained in the center of GL n (k) 
or else will contain the whole group SL n (k). This will be done by 
showing that G contains all trans vections if it is not contained 
in the center of GL n (k). The method will frequently make use of the 
following statement: If a e SL n (k) and r e (?, then crro-'V" 1 is also 
in G since it is the product of oro-" 1 and of r" 1 ; rar'V" 1 and similar 
expressions lie in G as well. 

LEMMA 4.3. Let G be invariant under unimodular transformations 
and suppose that G contains a transvection r j 1. We can conclude 
that SL n (k) C G unless n = 2 and at the same timek is a commuta- 
tive field of characteristic 2. 

Proof: 

1) Let n > 3 and let a range over SL n (k). The element oro-" 1 
ranges over all transvections 5^ 1 and, consequently, SL n (k) C G. 

2) Let n = 2, (A) be the hyperplane (and direction) of r, and B 
be some vector independent of A. Then V = (/I, B) and r may 
be described by its effect on A and B: 

(4.4) r(A) = A, r(B) = A\ + B. 

Conversely, any r of this form is a transvection with hyperplane 
(A). If we wish to denote its dependency on A, then we shall write T\ . 
Select <r t GL 2 (V) given by 

(4.5) <r(A) = Act, <r(B) = Aft + By. 

Applying o-" 1 to (4.5) we obtain A = (r~ 1 (A)a, <r~ l (A) = AoT 1 and 
B = <r- l (A)0 + <r~ l (B)v, hence3, o-' 1 ^) = -Aa~ l fty' 1 + By' 1 . 
Obviously (r 1) A = and (T l)B = ^4X, and hence we have 
a(r - 1)<7~M = and a(r - l)^' 1 ^ = ^A-Xy' 1 ) = Aa\y~ l . 
Therefore 

(4.6) (rTcT'A = A, <TT<T- I B = Aa\y~ l + B. 

This computation shall also be used a little later. 

If we want a unimodular, then the determinant of the matrix 



(a P\ 

\0 7/' 



which is 07, should be 1. This is certainly the case if we set 
y = X'Vx^A. For this choice of 7, (4.6) becomes 
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<TT<r~ l A = A, <rr<r- l B = A-a\ + B 

which is the transvection r a ^ and lies also in G. 

The set T of all X in k for which r x e G contains elements 7* 0. If 
Xi , X 2 e T, then rx.r^ 1 = r Xl *x a c G showing that T 7 is an additive 
group. We have just seen that for any a e k* the set c?T is contained 
in T. Hence c^a\ - a*T C T and, therefore, ST C T where S is 
the set of Lemma 4.2. By our assumption about k and Lemma 4.2 
we conclude that S = k and, consequently, S2 7 = kT = k. G con- 
tains all transvections with the direction (A). Theorem 4.6 shows 
that G contains all transvections and, consequently, the whole 
group SL 2 (k). 

LEMMA 4.4. Suppose that n = 2 and that G contains an element 
a whose action on some basis A, B of V is of the type given by formula 
(4.5). We can conclude that SL 2 (k) C. G if either y is not in the center 
of k or, should y be in the center, if a ^ y. 

Proof: Let T be the transvection given by (4.4). The element 
p = 



(?T(7~ l T~ l is also in G. For T~ I we find 



r-\A) = A, r~\B) = -A\ + B. 
Making use of (4.6) we obtain for the action of p: 

p(A) = A, P (B) = A(a\y~ l - X) + /?. 

We have a free choice of X at our disposal and try to select it in 
such a way that a\y~ l X 5^ 0; for such a choice, p would be a 
transvection 7^ 1 of G and Lemma 4.3 would show that SL 2 (k) C G 
unless k is a commutative field with characteristic 2. The choice 
X = 1 will work unless a = 7; by our assumption y is not in the 
center if a = y and there exists a X such that y\ XT 5^ 0, hence 
(y\ \y)y~ l = a\y~ l X 5* 0. There remains the case where k is 
commutative with characteristic 2. The element y is then in the 
center, and, by our assumption, a 5^ y. The factor is a\y~ l X = 
(ay" 1 1)X and will range over & if X ranges over k. The element p 
will then range over all transvections with direction (A). By Theo- 
rem 4.6 all transvections are in G, hence SL 2 (k) C G. 

THEOREM 4.9. Suppose that either n > 3 or that n = 2 but that 
k contains at least four elements. If G is a subgroup of GL n (k) which is 
invariant under unimodular transformations and which is not contained 
in the center of GL n (k), then SL n (k) C G. 
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Proof: 

1) n = 2. The group G must contain an element cr which moves 
a certain line (A). Set <rA = B, then V is spanned by A and J? and 
the action of <r may be described in terms of the basis A, B: 

aA = B, <rB = Aft + By, ft j* 0. 

We select any non-zero element a in k and define an element r 
of (?L 2 (fc) by 

rA = A/ya Sa, rB = Aa" 1 . 
It belongs to the matrix 

\-a 

which has determinant 1 so that r t SL 2 (k). This implies that p 
T~ l <r~ l T(r is also in G. We find 



err A = l?7a A0a Bya = A0a, crrB = BaT 
hence, for the inverse of or 

T- l <r~ l A = -Aa~ l p- 1 , r^a^B = Ba. 
For TO- we obtain 

TO- A = Aa" 1 , roB = A7aj8 Baft + Aa~*y 
= A(7a/9 + a'SO - Baft 
and can now compute p = ^V'Vo-: 



a~V) - Ba*ft. 

We try to select a in such a way that Lemma 4.4 can be used on p. 
Is a choice of a possible such that a 2 ft is not in the center of k? If ft 
is not in the center, choose a = 1. If ft is in the center of fc, but if k 
is not commutative, we choose a in such a way that a 2 and, therefore, 
a 2 ft is not in the center; Lemma 4.1 assures us that this can be 
done. Thus non-commutative fields are settled. If k is commutative, 
we try to enforce that a 2 ft j& a' 1 ft' 1 a" 1 which is equivalent with 
a 4 5^ ft" 2 . The equation re 4 = ft" 2 can have at most four solutions; 
if our field has at least six elements, then we can find an a 5^ such 
that a 4 ^ ft" 2 . This leaves us with just two fields, F 4 and F . 
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In 7^4 there are three non-zero elements which form a group, there- 
fore a 8 = 1, a 4 = a if a j& 0. All we need is a j& ft" 2 which can, of 
course, be done. 

In F 5 we have a 4 = 1 for any a 7* and we see that our attempt 
will fail if /T 2 = 1,0 = 1. Should this happen, then we select 
a in such a way that a 2 ft = 1; this is possible since I 2 = 1, 2 2 = 1 
in P\ . The action of the corresponding p is given by 

P (A) = -A, P (B) = -A-27-J3. 
Since p e (7, we have p 2 t G and 

P 2 A = A, p 2 B = A-47 + B; 

if 7 T^ 0, then p 2 j& 1 is a transvection and Lemma 4.3 shows 
SL 2 (k) C G. 

This leaves us with the case fc = F 5 ,/3=dbl,7 = 0. 

If ft = 1, then <rA = B, <rB = A. Change the basis: C = A + B, 
D = A - B. Then 

aC = C, ffD = -D. 

If = -1, then o-A = B, o-B = -A and we set C = A + 2B, 
Z> = A - 2B. Then 

aC = B - 2A = -2C, <rZ) = 5 + 2A = 2D. 

In both cases we can use Lemma 4.4 on a-. This finishes the proof 
if n = 2. 

2) n > 3. Select an element or in (?, which moves a certain line 
(A), and set o-A = J5. Let T j 1 be a transvection with direction (A). 
The element p = <TT<T~ I T~ I is again in G and is the product of the 
transvection r~ l with the direction (A) and the transvection ore-"" 1 
with the direction cr(A} = (B). These directions are distinct, whence 
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The transvection r~ l will move a vector X by a multiple of A 
and the transvection o-ro-" 1 will move r~ l X by a multiple of B. It 
follows that p will move X by a vector of the plane (A, B). Since 
n > 3, we can imbed (A, B) in some hyperplane H. Then pff C H, 
since a vector X tHis moved by a vector of (A, B) C H. The element 
p e G has the following properties: 

a) p 5* 1, b) pff = ff, c) P X - XtH for all X e F. 
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Suppose first that p commutes with all transvections belonging 
to the hyperplane H. Select an element <p E F which describes H 
and let C be any vector of H. Let n be the transvection 

*X = X + C-<p(X). 

We find v(pX) = <f>( P X-X) + ?(X) = <p(X) because of property c). 
We obtain 

= pX + pC-<p(X), 

= P X + C-<p(pX) = P X + 



Since we assume right now that pr t = TIP, a comparison implies 
that pC = C because X can be selected so that <p(X) j 0. But C is an 
arbitrary vector of H\ i.e., the element p leaves every vector of H 
fixed. Together with property c) this implies that p is a transvection. 
By Lemma 4.3 we conclude that SL n (k) C G. 

We may assume, therefore, that there exists a transvection r l 
with hyperplane H such that pri ^ r l p and, consequently, X = 
pr\p~ l ri l 7* 1. The element X is again in G and is a product of the 
transvections ri" 1 and prip" 1 whose hyperplanes are H respectively 
pH = H. The element X is itself a transvection, since it is a product 
of transvections with the same hyperplane. By Lemma 4.3 we have 
again SL n (k) C G and our proof is finished. 

Theorem 4.9 sheds some light on the definition of determinants. 
Suppose that we have in some way given a definition of determinants 
which works for non-singular elements of Hom(F,F), i.e., for 
elements of GL n (k). The only condition that we impose on deter- 
minants shall be det <JT det cr-det r but the value of a determinant 
may be in any (possibly non-commutative) group. This new deter- 
minant map is then a homomorphism of GL n (k) and the kernel G of 
this map an invariant subgroup of GL n (k). Let us leave aside the two 
cases GL 2 (F 2 ) and GL 2 (F 3 ). We would certainly consider a determinant 
map undesirable if G is contained in the center of GL n (k), since our 
map would then amount merely to a factoring by certain diagonal 
matrices in the center. In any other case we could conclude 
SL n (k) C G and this implies that the new determinant map is weaker 
than the old one which has SL n (k) as the precise kernel. 

DEFINITION 4.2. Let Z be the center of the group SL n (k). 
The factor group SL n (k)/Z is called the projective unimodular 
group and is denoted by PSL n (k). 
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THEOREM 4.10. The group PSL n (k) is simple] i.e., it has no 
invariant subgroup distinct from 1 and from the whole group, except for 

Proof: Let F be an invariant subgroup of PSL n (k). Its elements 
are cosets of Z . Let G be the union of these cosets; G is an invariant 
subgroup of SL n (k). If G C Z , then F = 1; otherwise G will be 
SL n (k) and hence F = PSL n (k). 

3. Vector spaces over finite fields 

Suppose that fc = F q , the field with q elements. Our groups 
become finite groups and we shall compute their orders. 

Let A l , A 2 , , A n be a basis of V and o- e GL n (k). The element 
<r is completely determined by the images /?, = ovl, of the basis 
vectors. Since o- is non-singular, the B i must be linearly independent. 
For B l we can select any of the q n 1 non-zero vectors of V. Suppose 
that we have already chosen B l , B 2 , , E i (i < n); these vectors 
span a subspace of dimension i which will contain #* vectors. For 
B i+1 we can select any of the q n q 1 vectors outside of this subspace. 
We find, therefore, 

n 

(q n - l)(f -<?) (q n - gT 1 ) = g <- 1)/2 fj ( g * - 1) 

possibilities for the choice of the images. This number is the order 
of GL n (k). 

The group SL^k) is the kernel of the determinant map of GL n (k) 
onto the group fc* = fc* (fc is commutative by Theorem 1.14) and fc* 
has q 1 elements. The order of SL n (k) is, therefore, obtained by 
dividing by q 1, hence simply by omitting the term i = 1 in the 
product. 

The order of the center of SL n (k) is equal to the number of solu- 
tions of the equation a n = 1 in fc*. The elements of fc* form a group 
of order q 1, they all satisfy x q ~* = 1. Let d be the greatest common 
divisor of n and q 1. Then we can find integers r and s such that 
d = nr + (q l)s. 

If a n = 1, then a d = (<x n ) r - (a q ~ 1 )' = 1; conversely, a* = 1 implies 
a n = 1, since d divides n. We must now find the number of solutions 
of the equation a d = 1 (where d divides q 1). The equation 
x q ~ l 1 has q 1 distinct roots in fc, namely all elements of fc*. 
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It factors into q 1 distinct linear factors. The polynomial x d 1 
divides x'~ l 1 and this implies that x* 1 factors into d distinct 
linear terms. The order of the center of SL n (k) is, therefore, d. 

To get the order of PSL n (k) we have to divide the order of SL n (k) 
byd. 

THEOREM 4.11. The order of GL n (K) is g n(n - 1)/2 JJ?-i (?' - 1), 
that of SL n (k) is q n( -' 2 JJ;., ( q < - 1) and the order of PSL n (k) is 
(1/d) g n(n ~ l)/2 JJ^Ig (</* 1) where d is the greatest common divisor of 
n and q 1. We know that PSL n (k) is simple unless n = 2, q = 2 
or n = 2, q = 3. 

For n = 2 we obtain as orders of PSL n (k) : 

q(q* - 1) ., . ,, 

:m -r if q is odd, 

& 

q(q* 1) if q is even. 

For q = 2, 3, 4, 5, 7, 8, 9, 11, 13 we find as orders 6, 12, 60, 60' 
168, 504, 360, 660, 1092. 

If V is of dimension n over F Q , then V contains q n 1 non-zero 
vectors. On each line there are q 1 non-zero vectors, so that V 
contains (q n \}/(q 1) lines. An element of SL n (F Q ) may be 
mapped onto the permutation of these lines. The kernel (if the 
lines are kept fixed) is the center of SL n (F g ). This induces a map 
of PSL^FJ onto the permutations of the lines which now is an 
isomorphism into. 

If n = 2, q = 2, 3, 4 we have 3, 4, 5 lines respectively. We see: 

1) GL 2 (FJ SL 2 (F 2 ) = PSL 2 (F 2 ) is the symmetric group of 
3 letters. It is not simple and SL 2 (F 2 ) is not the commutator group 
of GL 2 (F 2 ) which is cyclic of order 3. 

2) PSL 2 (Fz) is the alternating group of 4 letters. It is not simple 
either. 

3) PSL 2 (F 4 ) is the alternating group of 5 letters. The group 
PSL 2 (F 6 ) has also order 60 and it can be shown to be isomorphic 
to PSL 2 (FJ. 

Other interesting cases: 

4) PSL 2 (F 7 ) and PSL 3 (F 2 )have both order 168. They are, in 
fact, isomorphic. 

5) PSL 2 (F 9 ) has order 360 and is isomorphic to the alternating 
group of 6 letters. 
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6) The two simple groups PSL 3 (F 4 ) and PSL 4 (F 2 ) have both the 
order 20160 which is also the order of the alternating group A 8 
of 8 letters. One can show that A s ~ PSL^(F 2 ). Since it is very 
interesting to have an example of simple groups which have the same 
order and are not isomorphic, we will show that PSL 3 (F 4 ) and 
PSL/i(F 2 ) are not isomorphic. 

The non-zero elements of F 4 form a cyclic group of order 3. It 
follows that the cube of each element in the center of GL 3 (F 4 ) is 1. 
Take an element a of PSL 3 (F 4 ) which is of order 2; cr is a coset and 
let cr be an element of SL 3 (F 4 ) in this coset. a? = 1 means a 2 is in 
the center of /SL 3 (F 4 ), therefore o- 6 = 1. The element <r 3 lies in the 
coset a 3 = & and we could have taken cr 3 instead of a. Making this 
replacement we see that the new cr satisfies <r 2 = 1. 

Let H be the kernel of the map (1 + a)X = X + <rX. Since 
V/H ~ (1 + <r)7, we obtain 

3 = dim V = dim H + dim(l + (r)F; 

dim H = 3 would imply (1 + <r)V = 0, X + <rX = for all X e V 
hence <rX = X (the characteristic is 2). But this would mean a- = 1 
whereas we assumed a of order 2. Consequently dim // < 2. X e H 
means X + <rX = or crX = X\ H consists of the elements left 
fixed by a-. The elements in (1 + a-) V are left fixed, since <r(X + <rX) = 
crX + a*X = (rX + X which implies (1 + <j)F C H and conse- 
quently dim(l + <r)V < dim H. This leaves dim H = 2 as the only 
possibility, H is a hyperplane and cr moves any X by <rX + X t 
(1 + cr)7 C II] * is, therefore, a transvection ^ 1. Since n = 3, any 
two such transvections 5^ 1 are conjugate and this shows that the 
elements of order 2 in PSL^F*) form a single set of conjugate 
elements. 

Let us look at PSL 4 (F 2 ) = SL 4 (F 2 ) = GL 4 (F 2 ). Among the 
elements of order two we have first of all the transvections 5^ 1 
(5 12 (1) 2 = -812(0) = 1) which form one class of conjugate elements. 
Let A l , AI , A 3 , A 4 be a basis of V and define T by 

= A l , r(A 2 ) = A l + A 2 , 
= A 3 , r(A 4 ) = A 3 + A 4 . 

We have r 2 = 1 but T is not a transvection since A 2 and A 4 are 
moved by vectors of different lines. Thus the elements of order 2 
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form at least two sets of conjugate elements and PSL 3 (F 4 ) and 
PSL^(F 2 ) are not isomorphic. 

It can be shown that no other order coincidences among the 
groups PSL n (F q ) or with alternating groups occur aside from the 
ones we have mentioned. 



CHAPTER V 



The Structure of Symplectic and Orthogonal 

Groups 



In this chapter we investigate questions similar to those of the 
preceding chapter. We consider a space with either a symplectic 
or an orthogonal geometry and try to determine invariant subgroups 
of the symplectic respectively orthogonal group. This is easy enough 
in the symplectic case and one may also expect a similar result for 
the orthogonal groups. However it turns out that not all orthogonal 
groups lead to simple groups. The problem of the structure of the 
orthogonal groups is only partly solved and many interesting ques- 
tions remain still undecided. We begin with the easy case, the one 
of the symplectic group. 

1. Structure of the symplectic group 

We shall prove the analogue to Theorem 4.4. 

THEOREM 5.1. Let G be an invariant subgroup of Sp n (k) which 
is not contained in the center of Sp n (k). Then G = Sp n (k) except for 
the cases Sp. 2 (F 2 ), Sp 2 (F 9 ) and Sp 4 (F 2 ). 

Proof: 

1) Let A e V, A 7* and suppose that G contains all trans vections 
in the direction A. If B is another non-zero vector, select T e Sp n (k) 
such that rA = B. If a ranges over the transvections in the direction 
A, then rcrr~ l ranges over the transvections in the direction B so 
that G contains all transvections. We have G = Sp n (k) in this case. 
If k = F 2 , there is only one transvection 9* I with direction A and 
if fa = F 3 t there are two, but they are inverses of each other. In the 
cases k = F 2 or k = F 3 it is, therefore, sufficient to ascertain that G 
contains one transvection ?* 1. 

2) If n = 2, V = (AT, M) where N, M is a hyperbolic pair, let 
<rN = aN + pM, vM = yN + 5M. The only condition a must 

173 
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satisfy to be in Sp 2 (k) is aN-aM = N-M = 1 which implies 
ad #7 = 1. This shows that Sp 2 (k) = SL 2 (k), the two-dimensional 
unimodular group of V. Theorem 4.9 proves our contention if n = 2. 
3) G must contain some <r which moves at least one line (A). 
Let <rA = B and assume at first that AB = 0. Since (A) 5^ (B), 
we have (A)* ? (B)*. The hyperplane (B)* is, therefore, not con- 
tained in (A)* and we can find a vector C e (B)*, C 4 (A)*. Then 
CB = but CA 7* 0; we can change C by a factor to make CA = 1. 
Let T be the transvection 

rX = X + ((C - A)X)-(C - A). 

Then T A = A + (C - A) = C and rB = B. 

The element p = ro-'V" 1 ^ is in G and we find p(A) = ro^V^S) = 
rar~ l (B) = r(A) = C. Since AC 7* 0, we see that it is no loss of 
generality to assume of our original a that a (A) = B and AB 9* 0. 

Let TI be the transvection r^X = X + (AX) -A. The element 
Pi = TjorrrV 1 belongs to G and we obtain 

(5.1) Pl (B) = T^rr'U) = TK7(A) = ^=5 + (AB)A. 

The transvection TI leaves every vector orthogonal to A and the 
transvection cr^V" 1 every vector orthogonal to B fixed. The product 
Pi will keep every vector orthogonal to the plane P = (A, B) fixed. 
This plane is non-singular, since AB j 0; we can write V = P JL P* 
and find that PI = p 2 -L lp* where p 2 e Sp 2 (k). Equation (5.1) 
shows that p 2 does not leave the line (B) of the plane P fixed, i.e., 
that p 2 is not in the center of the group Sp 2 (k). The set of all X e Sp 2 (k) 
for which X JL IP* belongs to G is an invariant subgroup of Sp 2 (k) 
which is not in the center of Sp 2 (K). If k 7* F 2 , F 3 this set will, 
therefore, be all of Sp 2 (k). In particular, if we take for X all trans- 
vections of P in the direction of (A), we see that G contains all 
transvections X JL lp* of V in the direction of (A). This proves that 
G = Sp n (k). 

If k F 3 , we have n > 4. We imbed P in a non-singular space 
F 4 of dimension 4, the element p 2 has the form p a _L 1 F4 * where 
Pa c Spi(k) and we could argue as in the preceding case, provided 
our theorem is proved for the group Sp 4 (k). Similarly, if k = F 2 , 
it suffices to prove the theorem for the group Sp 6 (k). 

We must, therefore, consider the two special cases Sp^(F z ) and 
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4) Suppose k = F 2 or F 3 , and assume first that the element PI 
leaves some line (C) of the plane P fixed. If k = F 2 , then the vector 
C itself is kept fixed, p l is a transvection of P, hence of 7, and our 
theorem is true in this case. If k = F 3 , then P! may reverse C. Setting 
P = (C, D), we obtain 



-C, 

since the effect of PI must be unimodular. The element is not 
since pi moves the line (A) of P. For pj we obtain 

p?C = C, p?D = D - 2fC 

and this is a transvection. 

We may assume that p l keeps no line of P fixed. If k = F 2 , then 
there are three non-zero vectors in P and PI permutes them cyclically; 
pr 1 will give the other cyclic permutation. If Q is another non-singular 
plane and r t Sp Q (F 2 ) such that rP = Q, then the elements TP* I T~ I 
are also in G and permute the non-zero vectors of Q cyclically, leaving 
every vector orthogonal to Q fixed. 

If k = F a , let (A) be any line of P. Setting p^A = B we have 
P = (A, B) and the unimodular effect of pi on P is given by 

Pl A = B, Pl B = -A + /3B. 

We contend that = 0; otherwise we would have p t (A + #6) = 
B - A + p 2 B = -0A - B = -0(A + 0B) (remember = 1 
in F 3 ) showing that the line (A + @B) is kept fixed. We see that 

Pl A = B, Pl B = -A, p?A = -B, p\B = A. 

Since AB = 1, one of the pairs A, B and B, A is hyperbolic. 
The group G contains, therefore, an element a which has the following 
effect on a symplectic basis NI y M l , N 2 , M 2 of V: 

<rNi = MI , <rMi = NI , <rAT 2 = AT 2 , <rM 2 = M 2 . 



We shall now investigate /Sp 4 (F 3 ) and Sp*(F 2 ) separately. The 
following type of argument will be used repeatedly: If A l , A 2 , , A n 
and B t , B 2 , , B n are two symplectic bases of V, then there exists 
are Sp n (k) such that rA t - = B< . If we know that <r e G and that 
<rA f = X)?-i flt>^ > then ror" 1 is also in G and 

n 

T(7T~ 1 B t = TO-A,- = 2.) irS, . 
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5) Sp 4 (F 3 )] let NI , Mi ; AT 2 , M 2 be a symplectic basis of V, then 
(5.2) AT, , N 2 + M, ; AT 2 , M 2 + AT, 

is also a symplectic basis. The group G contains an element a which 
has the following effect: 



(5.3) <rtf, = M, , (rM t = -AT, , dV a = AT 2 , <rM 2 = M 3 . 

[t also contains an element r having a corresponding effect on the 
basis (5.2): 

T #, = N 2 + M l , r(N 2 + MO = -AT, , rN 2 = AT, , 
(5.4) 

r(M 2 + AT,) = M 2 + AT, . 

We find, for or, 

+ M = - M\ , 

O = M 2 + M, , 



= AT t - AT 2 + M! + M 2 . 

The element p = (or) 2 is also in G and has the form 
Nt = AT, , P M, = M, , P AT 2 = A^ 2 , P M 2 = 



But this is a transvection and the proof is finished. 
6) Spt(F 2 )', we are going to use the following symplectic bases: 

(5.5) N, , M, ; # 2 , M 2 ; AT 3 , M 3 , 

(5.6) Nt , M , + M 2 ; ^ + AT 2 , M 2 ; AT 3 , Af, , 

(5.7) AT, , Jlf t ; A^ 3 , M 3 ; AT 2 , M 2 , 

(5.8) N 19 M 1 ; N 2 + N 3J M 2 ; AT 3 , M 2 + M 3 . 
Let (T! be the cyclic interchange (on the basis (5.5)) 

AT 2 - A^ 2 + M 2 -> M 2 -> AT 2 

keeping NI , MI 9 N 3 , M : > fixed. 
Similarly, let a 2 be the change (on the basis (5.6)) 
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M 2 -> N l + N, + M 2 -+ N } + N 2 



keeping N l , M l + M 2 , JV 3 , M 3 fixed. 
Set 0-3 = 0- 2 0"i . We find 



= Mj + Nt + N 2 , <r 3 M 2 = M 2 + N l , (r 3 M 3 = M 3 

and the vectors AT t are kept fixed. 

Call o- 4 , o- 5 the elements which have an effect similar to <7 3 on the 
basis (5.7), respectively (5.8). They keep the vectors AT, fixed and 
we obtain 



, = M, + N, + AT 3 , cr 4 M 2 = M 2 , 



The element r = o- 3 o- 4 o- 5 lies also in G Y , keeps the vectors N t fixed 
and since 

rM 1 = Mj + AT, , rM 2 = M 2 , rM 3 = M 3 , 
it is a transvection. This completes the proof. 

DEFINITION 5.1. The factor group of Sp n (k) modulo its center 
is called the projective symplectic group and denoted by PSp n (k). 

THEOREM 5.2. The groups PSp n (k) are simple except for PSp 2 (F 2 ), 



The center of Sp n (k) is identity if the characteristic of k is 2, it is 
of order 2 if the characteristic is j 2. In Chapter III, 6, we have 
determined the order of Sp^Fy). If we denote by d the greatest 
common divisor of 2 and q 1, then we find, as order of 



a 



- fl (" - 

- 



The order of the simple group PSp^(F 3 ) is 25920. This group 
turns up in algebraic geometry as the group of the 27 straight lines 
on a cubic surface. 

The order of the simple group PSp^(F 2 ) is 1451520. It also arises 
in algebraic geometry as the group of the 28 double tangents to a 
plane curve of degree four. 
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2. The orthogonal group of euclidean space 

We specialize k to the field R of real numbers. Let V be an n-dimen- 
sional vector space over R with an orthogonal geometry based on 
the quadratic form 

xl + xl + + xl . 

Then we say that V is a euclidean space. Let us first look at the 
case n = 3. No isotropic vectors are present, the geometry on each 
plane P of V is again euclidean so that any two planes PI and P 2 
of V are isometric. P 2 = rPi for some T E 0"J . It follows that all 
180 rotations are conjugate. Since they generate Oa we can say: 
If an invariant subgroup G of Oa contains one 180 rotation, then 
G - 0+3 . 

Now let G 7* 1 be an invariant subgroup of Os and <r ^ 1 an 
element of G. We look at the action of G on the unit sphere of vectors 
with square 1; <r has an axis (A) where A 2 = 1. In geometric language 
the argument is very simple: or rotates the sphere around the axis (A) 
by a certain angle. Let d be the distance by which a vector Y on 
the equator (AY = 0) is moved. The other vectors of the sphere are 
moved by at most d, and to a given number m < d one can find a 
point Z t which is moved by the distance m (into a point Z 2 ). If PI , P 2 
are any points on the sphere with distance m, then we can find a 
rotation r e Oa such that rZ x = Pj , rZ 2 = P 2 The rotation ror"" 1 
will also be in G and will move PI into P 2 . This shows that a point 
P! can be moved by an element of G into any point P 2 provided the 
distance of PI and P 2 is < d. By repeated application of such motions 
one can now move P, into any point of the sphere, especially into 
P! . Hence G contains an element <TI which reverses a certain 
vector. But in Chapter III when we discussed the case n = 3 
we saw that <TI must be a 180 rotation. Thus G = 3 . 

Some readers might like to see the formal argument about the 
point Zj : set Z l = A- Vl - p 2 + Y-p where < /* < 1. Then 



(cr - 1)Z, = M .(o- - 1)7, (Or - 

which makes the statement obvious. 

The group 0\ modulo its center is not simple, but we can show 

THEOREM 5.3. Let V be a euclidean space of dimension n > 3, 
but n 7* 4. If G is an invariant subgroup of On which is not contained 
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in the center of 0\ , then G = 0\ . This means that + n is simple for 
odd dimensions > 3; for even dimensions > 6 the factor group of + n 
modulo its center d=l v is simple. 

Proof: We may assume n > 5. Let or be an element of G which 
is not in the center of 0(V). If we had <rP = P for every plane 
P of V, then we would also have <rL = L for every line L of F, since 
L is the intersection of planes. Let P be a plane such that aP j* P 
and X the 180 rotation in the plane P. Then (rXo-" 1 is the 180 
rotation in the plane <rP and consequently 5* X. The element 
p = o-Xo-^X"" 1 is also in G, is 5* 1, and will leave every vector ortho- 
gonal to P + 0P fixed. Since n > 5, a non-zero vector A exists 
which is orthogonal to P + <rP. Since pA = A, we have p j* \ v ; 
since p 5^ l v , we know that p is not in the center of 0(V). 

The element p must move some line (5); set pB = C. Denote by 
T A the symmetry with respect to the hyperplane (A)*. The product 
M = r B r A is a rotation. We have pr^p" 1 = r pX = r A and pr B p~ l = r c 
and, hence, pjup" M~ = T C T A TA TB = TCT B ; the element o-i = r c r 5 
lies in G, is 5^ 1 since (C) 7* (B) and leaves every vector orthogonal 
to the plane Q = (B, C) fixed. If U is a three-dimensional subspace 
of V which contains Q, we may write <TI = PI A. lv where PI e l(U) 
and pi 5^ 1. The group 0* 3 (U) is simple and we conclude that G 
contains all elements of the form p 2 _L 1#* where p 2 tO + z(U). Among 
these elements there are 180 rotations and thus the proof of our 
theorem is finished. 

3. Elliptic spaces 

If the reader scrutinizes the preceding proof for n 3 he will 
notice a very peculiar feature: In concluding that "small" displace- 
ments on the unit sphere can be combined to give arbitrary displace- 
ments we have used the archimedean axiom in the field of real 
numbers. The following example shows that the use of the archi- 
medean axiom is essential. 

Let fc be an ordered field which contains infinitely small elements 
(elements > which are smaller than any positive rational number). 
The field of power series in / with rational coefficients is an example 
of such a field if we order it by the "lowest term" (see Chapter I, 
9). Take again n = 3, and base the geometry of V on the quadratic 
form x 2 + y 2 + z 2 . It is intuitively clear that the "infinitesimal 
rotations" will form an invariant subgroup of Oa . Hence Ot will 
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not be simple. We devote this paragraph to a generalization of this 
example. 

Suppose that a valuation of k is given whose value group will 
be denoted by S (an ordered group with zero-element). We assume S 
to be non-trivial (which means S does not only consist of and 1) 
and to be the full image of k under the valuation. 

Let V be a space over k with a non-singular orthogonal geometry 
and assume n = dim V > 3. 

Let A l , A 2 , , A n be a basis of F, g if = A,A, and (ha) the 
inverse of the matrix (g lt ). Let 



X = A 9 x v , Y = A.y. 

v-\ v-1 

be vectors of V. We have 



hence 

(5.9) x< = Z 

r-l 

We define a "norm" \\X\\ of X with respect to our valuation by 

(5.10) ||Z|| = Max \x t \ 

t 

and get from (5.9) (if Si t S is the maximum of |A tJ |) 

(5.11) ||X|| < srMax \XA.\. 

V 

If s 2 = Max,., |(7,, |, then XY = 2".<-i ff.^.?/. shows 

(5.12) \XY\ < S2 |m|.||F||. 

DEFINITION 5.2. V is called elliptic with respect to our valuation, 
if there exists an s 3 e S such that 

(5.13) \XY\ 2 <s, \X 2 \-\Y 2 \ 
for all X, Y t V. 

REMARK. Suppose that V contains an isotropic vector N j 0. 
Let N, M be a hyperbolic pair. Then (5.13) can not hold for X = N, 
Y = M and we see that V is not elliptic. If V does not contain iso- 
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tropic vectors 5* 0, then it may or may not be elliptic with respect 
to our valuation. 

LEMMA 5.1. V is elliptic if and only if there exists an s 4 c S such 
that \X 2 \ < 1 implies \\X\\ < s. . 
Proof: 

1) Suppose V is elliptic and \X 2 \ < 1. If we put F = A, in (5.13) 
we obtain that \XA,\ 2 < s 3 |A 2 |, hence, from (5.11), that 
I|<X"I| 2 < sis* Max, \A 2 V \ < si for a suitably selected s 4 . 

2) Suppose |X 2 | < 1 implies ||X|| < s 4 . Then ||7|| = s 4 would 
certainly imply |K 2 | > 1. Let X be any non-zero vector. Find a e k 
such that \\X\\ = \a\ and b e k such that s 4 = |6|. The vector (b/d) X 
has norm s 4 and, consequently, \((b/a)X) 2 \ > 1 or s 2 \X 2 \ > \\X\\ 2 . 
For all X e V we have, therefore, \\X\\ 2 < si \X 2 \. From (5.12) we 
get now 

\XY\ 2 <stsi\X 2 \-\Y 2 \, 
which implies that V is elliptic. 

LEMMA 5.2. Suppose that V contains a vector A ^ such that 
there exists an s 5 t S satisfying 

\XA\ 2 <s> \X 2 \. 

Then V is elliptic. 

Proof: If o- ranges over all isometrics, then the vectors vA will 
generate a subspace of V which is invariant under all isometrics. 
By Theorem 3.24 this subspace must be V itself. We may, therefore, 
select a basis A, = a t A of V from this set. Then 

\XA<\ 2 = IrfX-^Af = K^-AI 8 < s & l^r 1 ^) 2 ! = s 
and from (5.11) 



Thus if \X 2 \ < 1, then ||X|| 2 < s 2 s 5 < si for a suitably selected 4 . 
V is elliptic. 
The following definition still works for any space. 

DEFINITION 5.3. An isometry o- of V is called infinitesimal of 
order s (s j 0, s e S) if 

- Y)\ 2 <s |X 2 |.|7 2 | for all X, Y E V. 
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We shall denote by G, the set of all infinitesimal isometrics of a 
given order s. Clearly identity lies in (?. . 

THEOREM 5.4. G. is an invariant subgroup of 0(V). 

Proof: 

1) Let cr, T e G. , then 

\X(*rY - F)| = \X(*rY - rY) + X(rY - Y)| 

rY) - rY)|, 



which shows or e (7. . 
2) Let cr t G. , then 

|X(er l r - 7)| 2 = \<rX(Y - 



so that <r~ l tG, . 

3) Let <r e G. , T e 0(7), then 
\X(rar~ l Y - 7)1' = \T- l X(<rr- l Y - r^Y)? < 



which proves that G 9 is an invariant subgroup of 0(V). 
THEOREM 5.5. As s decreases the sets G, decrease and we have 

n<?. = 1. 



Proof: The first statement is obvious. To prove the last it suffices 
to exhibit a G t which does not contain a given a ^ 1 . 

Select a Y such that orY Y 7* and an X such that 
X(aY - Y) * 0. Then 



- F)| 2 <S \X*\-\Y*\ 
will be false for a small enough s E S. 

THEOREM 5.6. // V is not elliptic, then (?. = I for any s e S. 
Proof: It suffices to show that if <r 5^ 1 is contained in some G, , 
then V is elliptic. 
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Select Y so that A = aY - F ^ 0. Then 

\XA\ 2 <s\X 2 \-\Y 2 \ = s 5 \X 2 \ 

for all X e F and a suitable s 5 . Lemma 5.2 shows that V is elliptic. 
We may, therefore, assume from now on that V is an elliptic space. 

THEOREM 5.7. // V is elliptic, then every G t contains elements 
a * 1. 

Proof: The idea is the following. Denote by T A the symmetry 
with respect to (A)* and let A and B be vectors which are independent 
but "very close"; then T A ^ T B , hence a = T B T A j^ 1 and a should 
be "nearly" identity. For a proof we would have to estimate 

X(*Y - Y) = X(r B r A Y - F) = r B X(r A Y - r B Y) 

in terms of \Y 2 \ and \X 2 \ = \(T B X) 2 \. It is, therefore, enough to 
estimate X(r A Y - r B Y) in terms of \X 2 \ and |F a |. 
For T A Y we have a formula: 



One sees this by observing that the right side is a linear map which 
is identity on (A)* and sends A into A. Therefore 



and 

(5.14) X(r A Y - r B Y) = -~ (A\BX)(BY) - B\AX)(AY)). 

We select a fixed A ^ and a fixed C independent of A. Choose 
t\ j in A; and set B = A + yC which will achieve T A ^ r B . This 
77 will be chosen "very small". 

We have B 2 = A 2 + 2<n(AC) + ri 2 C 2 and if \rj\ is small enough 
|A 2 | is the maximal term; from A 2 = B 2 2ij(AC) ij 2 C 2 one sees 
that for small |T?| the term |B 2 | must be maximal, since A 2 ^ 0. 
This shows \A 2 \ = |JB 2 | so that the factor 2/A 2 B 2 of (5.14) is merely 
a constant in absolute value and may be disregarded. 

If B = A + rjC is substituted in the parentheses on the right 
side of (5.14) the term A 2 (AX)(AY) will cancel and a polynomial in 
17 will remain which has a factor 77. Factoring out this 77, the coefficients 
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of the remaining polynomial in t\ will be, apart from constant factors, 
products of either (AX) or (CX) by either (AY) or (CY). This 
shows that we can find an s' e S such that for small |ij| we have 



\X(r A Y - r B Y)\ < S '-M-Max(|AX|, \CX\).Max(\AY\, \CY\). 
Since the space is elliptic, we obtain 

\AX\ 3 < 8 3 \A*\ \X 3 \, \CX\ 3 < 8 3 |C 2 | \X 3 \ 
and see that an s" t S can be found such that 

\X(T A Y- T B r>i<s"\i\-\x 3 \-\Y 3 \. 

All we have to do is to select |TJ| small enough. 

THEOREM 5.8. // |s| < |4|, then G. does not contain isometries a 
which reverse some non-zero vector. It follows that then G, C 0*(V), 
that G 9 does not contain any element of even order and (if V is elliptic) 
that G. n fl(F) 5* 1. 

Proof: Suppose a A = - A, A 7* and V elliptic. Put X = Y = A. 
Then 

|X(o-y- 7)| 2 = |4|-|AT- |4|.|X 2 |.|7 2 | 

which shows the first part of the contention. 

If a is a reflexion and n = dim V is odd, then a is a rotation. 
Since n is odd, a keeps some non-zero vector fixed, hence <r will 
reverse it. If n is even and or is a reflexion, a is a product of at most 
n 1 symmetries and keeps, therefore, a vector B ^ fixed. V is 
elliptic, B 2 ? 0. The reflection <r has the form 1 (1?> J_ r where T 
is a reflexion on the odd-dimensional space (B)*. It follows again 
that a reverses some non-zero vector. 

If a has order 2r, then a r is an involution ^ 1 and will reverse some 
vector; <r r can not be in G, , hence a can not be in it either. It follows 
that if a e G. and a 7* 1, then o- 2 c G. and o- 2 ^ 1. But <r 2 e 12(F). 

The groups G 9 C\ fi(F) form now a descending chain of invariant 
non-trivial subgroups of fl(F). Therefore, if V is elliptic, 12(F) is not 
simple either. 

THEOREM 5.9. Ifcr e G, , T c (?, , then the commutator <TTV~ I T~ I e G tt 
Proof: We have \X(<? - 1)7| 2 < 5 |X 2 |-|F 2 |. Put X = (<r - 1)7. 
We obtain |((o- l)Y) 2 \ < s \Y 2 \ and consequently 

\X(* - l)(r - 1)F| 2 < s \X 2 \.\((r - 1)F) 2 | < st |X 2 
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and 



Since <JT TO- = (<7 l)(r 1) (T l)(<r 1) we obtain 



If we replace F by <7~V 1 F, we get 

l - 1)F| 2 <st\X 2 \.\Y 2 \ 



and this proves our contention. 

This theorem is significant only if * < 1, / < 1. We mention as a 
special case that for s < 1 the commutator of two elements of G. 
lies in (r.. so that the factor group (?./(?. is abelian. 

THEOREM 5.10. // V is elliptic and s is sufficiently large, then 
G. = 0(7). 
Proof: Since we assume V elliptic we have 

\XY\ 2 <* 3 \X 2 \-\Y 2 \ and \X-<rY\ 2 <s 3 \X 2 \.\Y*\, 

hence \X(*Y - F)| 2 < s 3 \X 2 \ -|F 2 | which shows that any <r c (?.. . 

It would be of interest to know the structure of factor groups 
also f or s > 1. 

To give some examples let us turn to the one we gave at the 
beginning of this section. If k is an ordered field which is non-archi- 
medean we can associate with the ordering a valuation as described 
in Chapter I, 10. The finite elements of k are those which satisfy 
|a| < 1. 

We had based the geometry on the quadratic form x 2 + y 2 + z 3 . 
If \x 2 + y 2 + z 2 \ < 1, then x 2 + y 2 + z' 2 < m where m is an integer. 
Hence < x 2 < x 2 + y 2 + z 2 < m. It follows that \x 2 \ < 1 and, 
consequently, |x| < 1. This shows that \X 2 \ < 1 implies \\X\\ < 1. 
Our geometry is elliptic. 

To give a second example let k be the field Q of rational numbers 
and base the geometry again on the quadratic form x 2 + y 2 + z 2 . 
Select as valuation of Q the 2-adic one, where |a| < 1 means that a 
has an odd denominator. Let d be the least common denominator of 
x, y, z and x = r/d, y = s/d, z == t/d. Suppose |x 2 + y 2 + z*\ = 
|(r 2 + s 2 + / 2 )/d 2 | < 1. This implies that (r 2 + s 2 + t*)/d* can also 
be written with an odd denominator. We contend that d itself must 
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be odd. Indeed, if d were even, then r 2 + s 2 + f would have to be 
devisible by 4. One sees easily that this is only possible if r, s and / 
are even. But then d would not be the least common denominator 
of x, y, z. Thus we have shown that \x\ < 1, \y\ < 1, \z\ < 1 and 
\X 2 \ < 1 implies again that ||X|| < 1, our geometry is elliptic. 



4. The Clifford algebra 

Let F be a vector space with a non-singular orthogonal geometry. 
For deeper results on orthogonal groups we shall have to construct 
a certain associative ring called the Clifford algebra C(V) of V. 

We must first introduce some elementary notions of set theory. 
If S l , >S 2 , , S r are subsets of a given set M we define a sum 
Si + S 2 + + S r which will not be the union. It shall consist 
of those elements of M which occur in an odd number of the sets S t . 
The following rules are easily derived: 

(Si + S 2 + + S r ) + S r +i = Si + S 2 + + S r+ i , 
(Si + S 2 + - + S r ) n T 

= (Si r\ T) + (S 2 n r) + + (s r r\ r). 

The empty set shall be denoted by 0. 

Let now A l , A 2 , , A n be an orthogonal basis of V. The sets 
in question are the 2 n subsets of the set M = {1, 2, , n}. We 
construct a vector space C(V) over k of dimension 2 n with basis 
elements e s , one e a for each subset S of M . 

In C(V) a multiplication denoted by o is defined between the 
basis elements e s and extended to all of C(V) by linearity. The 
definition is: 



The symbol (s, /) is a sign; it shall be +1 if s < / and 1 if s > /. 
The term A* is the ordinary square of the basis vector A { of V and 
is, therefore, an element of k. If 3, T or S H T are empty, then 
"empty products" occur in the definition; they have to be inter- 
preted as usual, namely as 1. 

This definition will become understandable in a short while. 



CHAPTER V 187 

The main problem is the associativity. We have 
(e. er) oe R = fl ( 



jtS + T 



and we shall rewrite the right side in a more symmetric form. 

First the signs: If we let./ range not over S + T but first over all 
of S and then over all of T, then any j e S C\ T will appear twice. 
No change is produced in this way since (j, r) 2 = 1. The signs can, 
therefore, be written as 



tS tS ttT 

ttT rtR rtR 



which is a more satisfactory form. 

Now the A* : We have (S + T) H R = (S H R) + (T C\ R). 
If v belongs to all three sets S, T, 72, then it is in S C\ T but not in 
(S n R) + (T n 72) /If y belongs to S and T but not to 72, then 
it is in fl n T but not in (8 Pi 72) + (T Pi 72). If v lies in S and 72 
but not in 7 7 , or in T and 72 but not in S, then it is not in S C\ T 
but in (S Pi 72) + (TH 72). If, finally, v is in only one of the sets 
5, T 1 , 72 or in none of them, then v will neither be in S C\ T nor in 
(S n 72) + (T H 72). We must, therefore, take the product over 
those Al for which v appears in more than one of the sets S, T, 72. 

The form we have achieved is so symmetric that (e s o e T ) o e R = 
e s (e T 0) is rather obvious. Our multiplication makes C(F) into 
an associative ring, the Clifford algebra of V. One sees immediately 
that e o e a = es o e = e s which shows that C(V) has the unit 
element e^ which we also denote by 1. The scalar multiples of e+ : 
k + shall be identified with k. 

We shall also identify V with a certain subspace of C(V) as follows: 
the vector A t is identified with the vector e [t] (associated with the 
set {zj containing the single element i). The vector X = X)"-i x^A, 
of F is identified with the vector Jj"-i ^m f ^(^0- We must 
now distinguish the old scalar product XY in F and the product 
X o Y in C(V). We find 



If i 3^ j, then 

(A.- o A,-) + (Aj o 4.) = (,;>,,.,, + (j, t>i..fi = 0. 
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Let X = <-i x t A. , Y = X, t/,A,- ; then 



(X o Y) + (Y o X) = * t2 /,((A, o A,) + (A,, o A,)) 

i.i-l 



where (-XT) is the old scalar product in V. The rule 

(5.14) (X o F) + (7 o X) = 2(XY) 
has two special cases. The commutation rule 

(5.15) (X o Y) = -(FoX) 
if Jf and Y are orthogonal, and 

(5.16) X oX = X 2 . 

Let >S C M be non-empty. Arrange the elements of S in an in- 
creasing sequence: ^ < i 2 < < i r . Then 

e-s = A tl o A,-, o ... o A tr , 

as one shows easily by induction. The vectors A, are, therefore, 
generators of the Clifford algebra. We also understand now the 
definition of the multiplication e s e T : if one writes e s and e T as 
products of the basis vectors A, , one has to use the commuta- 
tion rule a certain number of times and then possibly the rule (5.16) 
in order to express e s o e T as multiple of e s f. T - The result would be 
our definition 1 . 

C(V) contains two subspaces C + (V) and C~(V) of dimension 2 n ~ 1 . 
C*(V) is spanned by the vectors e R for which S contains an even 
number of elements, C~(V) by those e R for which S contains an odd 
number of elements. If both S and T contain an even number or 
if both contain an odd number of elements, then S + T contains 
an even number. If one of them contains an even and the other an 
odd number of elements, then S + T will contain an odd number of 
elements. This leads to the rules: 



C C\V), C~(V) o C'(V) C 
C~(V) o C\V) C C-(F), C+(V) o C'(V) C 



1 It is now also easy to sec that C(V) does not depend on the choice of the or- 
thogonal basis of V. 



CHAPTER V 189 

Obviously, C(F) = C + (V) C~(V). Since 1 = e+ belongs to C+ (F) 
this subspace is a subalgebra of C(V) with a unit element. It is 
clear that C*(V) is generated by those e r for which T contains two 
elements. 

Each basis vector e s has an inverse which is just a scalar multiple 
of it, as is obvious from 

e s o e s = I! (i > $2) E[ 4? fc *- 

i , 1 1 5 1 3 

The product e 5 o e r differs only in sign from e T e s .If we denote 
by [/S] the number of elements in S, then we find for the sign 



We may write this result in the form 



We shall study the sign change which e T produces more closely. 
Let e s be a given basis element and T range over all sets with two 
elements, [T] = 2; is there a T which produces a sign change? We 
find as condition that [S r\ T] = 1, i.e., that one element of T 
should be in S and the other not. Such a T will exist unless S is 
either or the whole set M . 

An element a. t C(F) can be written in the form a = ^ s ysts 
with 7s c k. We wish to find those a which are in the centralizer 
of C*(V), and which, therefore, commute with all e T where [T] = 2. 
Then e T a & = a or 



For a given S 7* 0, M, one can find an e r which produces a sign 
change, hence y s = for these S. The centralizer of C + (7) is, there- 
fore, 



= k + fa* . 
C (F) is a subalgebra of C(F) with a very simple structure. We find 

\n<n-l)/2 ^2 j 2 J 2 / i\n(n-l)/2ynf 

) AI A 2 A n = ^ 1) Cr 



where (? is the discriminant of V. 

If n is odd, then e M is not in C + (V) and the center of C*(F) reduces 
to *. If n is even, then C (F) C C + (F), C (F) is the center of C* (F). 
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To find the center of C(V) we must see whether e M commutes 
with all e T where [T] = 1. One finds that this is the case if and only 
if n is odd. The center of C(V) is, therefore, k if n is even and C (V) 
if n is odd. 

The question which elements of C*(V) commute with all elements 
of V has a uniform answer: they are the elements of k in all cases. 
Indeed, they must commute with every element of C(V) and must lie 
in(T(F). 

We shall also have to find out which elements a = ^ s y s es 
of C(V) anticommute with every vector of V, i.e., satisfy a o A> = 
(A t o a) for all i. If 7$ ^ 0, then e T e s o e^; 1 must equal e s 
for all T with \T] = 1. This gives as condition that [S] - [S n T] 
must be odd for all such T. Only S = M could possibly do this 
and then only if n is even. The answer to our question is, therefore, 
a = if n is odd and a e ke M if n is even. 

If a is an element of C(V) which has an inverse, then one abbre- 
viates a o o a" 1 by the symbol a . The map C(V) > C(F) given 
by * a is an automorphism of C(V) which satisfies (*)" = a/J . 
We denote by R(V) the set of all a. which map the subspace V of 
C(V) into itself, i.e., for which X e V implies X" e V. Let a be such 
an element and let us apply the map to both sides of equation (5.14). 
The right side, 2(XY), remains fixed, being an element of k\ the 
left side becomes (X a o Y") + (Y* o X") which is 2(X"Y"). We 
obtain (XY) = (X*Y) which shows that a induces an isometry on 
Vj i.e., an element <r a of the group 0(V). 

The map a > o- is a homomorphism of the group R(V) into 
0(V). The kernel of this map consists of those elements a of the 
center of C(V) which have an inverse. If n is even, this kernel is fc*; 
if n is odd, it is the set C*(V) of all elements of C (V) which have an 
inverse. We shall now determine the image of our map. 

Let B, X e V and B 2 ^ 0. Equation (5.14) shows (R o X) = 
2(BX) - (X o B). Equation (5.16) implies that JB" 1 = (l/B 2 )-; 
if we multiply the preceding equation on the right by B"" 1 , then 
the left side becomes X B and we obtain 



X" = B-X= -r B (X) 

where TB (X) = X - (2(BX)/B 2 ) B. The map T B : V -> F is identity 
on the hyperplane (B)* and sends the vector B into B. It is, 
therefore, the symmetry with respect to (B)*. 
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We remember that every isometry a of V is a product <r = 
TB^TB, r Br of symmetries. The fact that one can achieve r < n 
will not be used right now. If a e 0*(F), then r will be even, otherwise 
it is odd. We have found 

X* = - TB (X) 

and obtain, by raising X to the powers B r , J5 r _i , , B l successively, 
that 



If n is even, then ly is a rotation and we see that ( l) r er can 
range over all of 0(F). If o- is a rotation, then a(X) is expressed 
directly; if o- is a reflexion, the reflexion a is given by our formula. 
The map R(V) 0(V) is onto, the kernel of this map was k* and 
we see that every element of R(V) has the form 7 B l o B 2 o o B r ; 
since 7 e fc* can be united with B l we may say that the elements of 
R(V) are merely products B^ o B 2 o o J5 r of non-isotropic vectors. 
If r is even, then the element lies in C + (F), otherwise it is in C~(F). 
Rotations are only produced by the elements of C + (V), reflexions 
by those of (T(F). 

If n is odd, then l v is a reflexion and the map ( l) r <7 is always 
a rotation. This raises the question whether one can find an a. e R(V) 
such that X" = a(X) is a reflexion. Setting ft = B L o B 2 o - - - o B r 
(where r is odd) the element /3" 1 o a = 7 of R(V) would induce the 
map X y = X. But we have seen that no 7 5^ can anticommute 
with every element of V if n is odd. The result is that the image of 
R (V) consists only of rotations. These rotations can already be 
obtained from elements B l o B 2 o - - - o B r which will lie in C* (F) 
if r is even and in C~(F) if r is odd. This suggests the 

DEFINITION 5.4. An element a of either C + (F) or of C"(F) 
is called regular if a has an inverse and if a maps F onto F : X t V 
implies X" e F. 

If n is odd, then a arid induce the same rotation if and only if 
they differ by a factor of C"5(F). Should both of them be either in 
C + (F) or both in C~(F), then this factor must be in A;*, since 
M * C~(V). The regular elements are products of non-isotropic 
vectors in this case also. 

We are mainly interested in the group + (F) and can restrict 
ourselves to the regular elements of C*(F). They form a group 
which shall be denoted by D(V). We obtain a uniform statement: 
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THEOREM 5.11. The regular elements of C*(V) are products of 
an even number of non-jsotropic vectors of V. The map D(V) > 0*(V) 
given by a > <r a is onto and its kernel is k*. We have, therefore, an 
isomorphism: 0+(V) ~ D(V)/k*. 

We return to the expression 

e a =* Ai> o A t . o o A ir , ti < ti < < i r , 

S = {ii , i a , ,t r } 

for non-empty subsets S of M . Let us reverse all the factors and call 
the resulting product e : 

e' B = A ir o A.,., o ... o A,- t . 
The commutation rule shows that e& differs from e 5 only by a sign, 



In agreement with this rule we set 1 ; = e^ e+ = 1 and extend 
the map to C(V) by linearity. If a = ^ 3 yse s , then a' = X) Vses . 
Let us compare (for any At) the products 

A< o e s = A 4 o (A <4 o A,-, o o AJ 
and 

e s ' o A i = (A,- r o A,. r _ t o o A,-,) o A,- . 

In order to express the first product as a multiple of an e T one 
has to use the commutation rule a certain number of times to bring 
At into the correct position and then possibly the rule A,- o A,- = A? , 
if i occurs among the i, . 

Expressing the second product as a multiple of an CT we see that 
we have to use the same number of interchanges followed by a 
possible replacement of A, o A t - by A? . This shows: 

Should A, o e s = ye T , then 65 o A t - = ye{> . This is the special 
case of the rule 

e' a oej, = (e T oe s Y 

if T contains only one element. By induction one can now prove 
this rule for any T. 

The map J : C(V) C(V) given by a = a' is, therefore, an 
antiautomorphism of C(V). 
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One is lead to introduce a norm on C(V): 
N(a) = ao a J . 

This norm does not satisfy in general the rule N(a o ft) = N(d) o N(0). 
It does so, however, if a and ft are regular. Indeed, if 

a = B l o B 2 o - o B r , 
then 

N(a) = a o a J = (B, o B 2 o . o B r ) o (B r o B r . v o . o BJ 
and from B r o B r = B 2 , B r _ l o B,^ = B 2 -i , one obtains 
(5.17) N(a) = BlBl #efc*. 

This implies N(a o p) = o p o tf o a j = N(0) - (a o </) = N(a)N(fi 
if a and /3 are regular. 

We are now in a position to prove the main lemma: 

LEMMA 5.3. Let BI , B 2 , , B r be non-isotropic vectors of V, 

T Bt the summetry with respect to {#,)* and assume r B ^'T B9 r Br 

= 1 v . The product B\E\ - E\ is then the square of an element of fc*. 

Proof: Since l v is a rotation, we know that r is even. The element 
Bt o B, 2 o . . . o B r of C + (V) induces the identity on V and is, there- 
fore, in the kernel k* of the map D(V) > + (V). If we set 
a = B! o B 2 o . . o B r , then a e fc*. The norm of a is a o a j = 
a* a = a 2 on one hand and, by (5.17), B\B\ #? on the other. 

5. The spinorial norm 

Let a- = T Xl T X , r Ar = T 5l r fll - r^. be two expressions for 
an isometry a as product of symmetries. Then 

TA l TA % " ' ' T A r T B, T B t -t ' ' ' r B^ == 1/J 

and Lemma 5.3 implies that A\A\ A 2 r B 2 ,B*-i B\ is a square 
of fc*. The product AM' A* differs, therefore, from B\Bl - # 
by the square factor 

A? A 2 2 A?jB?B 2 2 --g; 
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Let us denote by fc* 2 the multiplicative group of squares of fc*. 
Then we have shown that the map 

(5.18) e(<i) = Al Al A*-k* 2 

is well defined. Since obviously 0((7itr 2 ) = 0((ri)0(o- 2 ) it gives a homo- 
morphism 0(V) > /c*/fc* 2 . Since the image group is commutative, 
the commutator group fi(F) is contained in the kernel. 

It is, however, not yet a reasonable map. If we replace the scalar 
product XY on V by a new product / (-XT), where t is a fixed element 
of k*, then the orthogonal group does not change, isometries are still 
the same linear maps of V. The map 6 (a) does change, however, by 
the factor t r . If r is odd, 0(<r) changes radically and only for even r 
does it stay the same. One should, therefore, restrict our map to 
rotations. Since &(F) C + (V) the group fi(F) is still contained in 
the kernel 0'(V) of this restricted map. 

DEFINITION 5.5. The map (5.18) of (T(F) into A;*/** 2 is called 
the spinorial norm. The image consists of all cosets of the form 
AlAl A 2 r k* 2 with an even number r of terms. The "spinorial 
kernel" O'(F) of this map satisfies 12(7) C 0'(V) C + (F). 

Aside from the map 6 we introduce the canonical map / : 0(V) > 
0(F)/Q(F) and state the main property of / to be used. 

THEOREM 5.12. If <r = T AI T A , r Ar e 0(V), then f(a) depends 
only on the set {A\ , Al , , A 2 r \ and neither on the particular vectors 
A i , A 2 , , A r nor on their order. 

Proof: Suppose A? = B\ . There exists a X, e 0(V) such that 
\iA t = B< . Then \ % T At \~ l = r Bt . Since the image of / is a com- 
mutative group we have/ (r.) = /(X.7M.X7 1 ) = /(X)/(r At )/(X i )~ 1 = 
/(T A< ) hence /(cr) = /(TL r *<) where any arrangement of the product 
of the r Bl gives the same result. 

THEOREM 5.13. If V = UW, <r = rp where r c + (U) 
and p e + (W\ then 8(0) = 8(r)8(p). If we identify r e <T (C7) with 
T J_ 1 w c + (F), /Aen 0(r) computedin U is the same as 0(r) computed in 
V. We may say that the spinorial norm is invariant under orthogonal 
imbedding. We have 0'(U] = 0+(U) H O'(F). 

Proof: Since a- = (r J_ l w r)(l t/ p) it is enough to show 0(r) = 
0(r JL W). Let r = T^TA, T Ar be expressed as product of sym- 
metries TA< of the space U. Then r Ai = TA, J_ ITT is the symmetry 
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with respect to (A t )* of V and r J. lw = r Al T An r An . All con- 
tentions are now obvious. 

THEOREM 5.14. Suppose a = T A T B is a product of two symmetries 
only and <r e 0'(V). Then v E 0(7). 

// dim 7 = 2, then 0'(7) = 0(7) and eacfe eZemen* o/ 0(7) is a 
square of a rotation. If dim 7 = 3, Men- we have also 0'(7) = 0(7) 
and each element of 0(7) is the square of a rotation with the same axis. 

Proof: 

1) If (r = T A T B , then 0((7) = A 2 5 2 /c* 2 . If <7 e 0'(7), then A 2 5 2 
a 2 e fc* 2 . Put #! = (A 2 /a)B, then r,,, = r* and B\ = (A 2 ) 2 2 /a 2 = A 2 . 
We may, therefore, assume from the start that A 2 = B 2 . But then 
/(TATB) = /(TATA) = 1 by Theorem 5.12, hence a- e 0(7). 

2) If n = dim 7 = 2 or 3, two symmetries suffice to express any 
a e 0~*~(7). Hence 0'(7) = 0(7). The squares of rotations generate 
0(7). If n = 2, + (7) is commutative, every a e 0(7) is a square. 
If n = 3, er e 0'(7), let (A) be the axis of a. Should A be isotropic, 
then o- is a square of a rotation with axis (A) by our previous discussion 
of such rotations. If A 2 j& 0, then cr may be identified with the rotation 
which it induces on the plane (A)* and the contention follows from 
0(<r) = 1 and the case n = 2. 

THEOREM 5.15. // 7 is any space, U a non-singular subspace of 
dimension 2 or 3, o- = r _L l r/ * e 0'(7), then r is a square and a- e 0(7). 
Proof: Use 6(<r) = B(r) and the preceding theorem. 

THEOREM 5.16. Let U be a non-singular subspace of V with the 
following properties'. 

1) 0'(U) = 0(17). 

2) To every vector A e 7 with A 2 ^ we can find a vector R e U 
with A 2 = B 2 . 

Then 0'(7) = 0(7). 

Proof: Let <r = T AI T A , r Ar e 0'(7). Find B t t U such that 
#J = A 2 and put ^ = T Bl r Bt - r Bv . By Theorem 5.12, f(d) = f(<rj 
and clearly 0(<r) = 0(<ri) = 1. Each r Bi leaves C7* element-wise 
fixed, <T! = o- 2 _L If/* and flfaO = 0(0- 2 ) = 1. By assumption cr 2 e 0(C7), 
<r 2 is a product of squares; it follows that cr l is a products of squares, 
/(^j) = i y and therefore f(d) = 1, er e 0(7). 

THEOREM 5.17. // 7 contains isotropic vectors 7* 0, then 0'(V) = 
0(7). 
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Proof: Let N, M be a hyperbolic pair of V. The subspace U = 
(N, M) satisfies the conditions of Theorem 5.16: condition 1) since 
dim U = 2 and condition 2) since (aN + bM) 2 = 2db may take on 
any value in k. 

THEOREM 5.18. // V contains isotropic vectors, then the factor 
group + (V)/ti(V) is canonically isomorphic (by the spinorial map) 
to k*/k* 2 . 

Proof: The ontoness follows from the proof of the preceding 
theorem, since the squares of vectors of (N, M) range over k. 

If V is any space, we can always imbed it in a larger space V for 
which 0'(7) = ft(F). This can be done in many ways, by setting, 
for instance, V = V _L P where P is a hyperbolic plane. We could 
also set V = V J_ L where L is a suitable line L = (B): for some 
A e 7, A 2 7* prescribe B 2 = -A 2 , then the vector B + A will 
be isotropic, 0'(V) = Q(V). If we write for such a space V = V JL U, 
then, by Theorem 5.13, 

0'(7) = (T( 



The geometric interpretation of 0'(V) is, therefore, the following: 
it consists of those rotations of V which fall into the commutator 
group of the "roomier" space V. 

THEOREM 5.19. l v E O f (V) if and only if V is even-dimensional 
and the discriminant G of V is a square. Should V contain isotropic 
vectors , then these are the conditions for l v tobe in &(V). 

Proof: If dim V is odd, then l v is a reflexion. Let dim V be 
even and A l , A 2 , - , A n an orthogonal basis of V. Then l v = 
TA>T A . r An , hence 0(-l F ) = A\Al A 2 -fc* 2 = (?fc* 2 . 

Up to here everything went fine. But if V does not contain iso- 
tropic vectors (is anisotropic), and if dim V > 4, then the nature 
of ft(F) is still unknown and even no conjectures for a precise de- 
scription of 12(F) are available. Dieudonn6 has constructed counter- 
examples for n = 4. 

6. The cases dim V < 4 

In these cases the structure of the Clifford algebra is much simpler 
than in the higher cases. We had seen that e J 8 = e' a = ( I) r(r ~ 1)/2 e 5 
where r is the number of elements of S. If n < 4, then r < 4; e s is 
kept fixed by the anti-automorphism J if and only if r = 0, 1, 4. 
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The elements of C*(V) which remain fixed under J will be k 
if n < 3 and C (V) = k + ke M if n = 4. Notice that f or n = 4 
the Algebra C (F) is the center of C + (V). The elements of C"(V) 
which are fixed under J are in all cases just the elements of V. 

If a t C*(V), then the norm Na = a o a is also in C*(V) and 
remains fixed under J since 

(a o a 7 ) J = a jl o a j = a o </. 

We conclude that Na e fc if n < 3 and JVa e C (F) if n = 4. For 
a, e C + (V) we have, therefore, #(<* o 0) = o o 0' o </ = 
NQ oao a J = Na o Nff. 

We know that a regular element a e C + (F) must satisfy # e &*. 
Suppose conversely that a e C*(F) and ATa = a o a j = a t k*. The 
element (I/a) </ is then the inverse a" 1 of a and we have for X t V 

X* = -(aoX o a j ). 
d 

X" will lie in C"(F) and, since (a o X o </)' = a o X o of, it is left 
fixed by /. Hence X* t V. For n < 3, Na is always in k. Should a 
have an inverse, then Na t fe* as Na o Na" 1 = 1 shows. The regular 
elements of C + (V) are, therefore, the elements with a norm 5* 
if n < 3. Should n = 4, one has to add the condition Na c k* since 
one only knows Na tC (V). The point is that X* c V is automatically 
satisfied. 

We had mapped the group D(V)/k* (Theorem 5.11) isomorphically 
onto + (V) by mapping the coset ak* onto the element o- a e + (V) 
given by <r a (X) = X". For the spinorial norm 0(o- a ) it follows by 
definition and formula (5.17) that 0(O = Na-k* 2 . The spinorial 
norm corresponds, therefore, to the norm map in the group D(V)/k*. 
Let us denote by D 2 (V) the group of those a e C*(V) for which 
Na c fc* a . Then we have the isomorphism 

0'(V) ~ 



Since we can change a in the coset ak* by an element of ft* we 
can assume of a that JVa = 1. Such an a is determined up to a sign 
and we can also write 

0'(V)~D Q (V)/{1} 
where D n (V) is the set of all elements of C*(V) with norm 1. 
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We discuss now the cases separately. 

I. n = 2. 

If A! , A 2 is an orthogonal basis of V, then e M o e M = A\A\ = G 
where (7 is the discriminant of 7. We find CM = e^ . The algebra 
C + (F) coincides with C (F) = fc + fee AT and thus is commutative. 
For a = a + be M we get a = a 6e M and 



= a 2 



If Na 76 0, then a is regular and, consequently, a = A o B where 
A and B are non-isotropic vectors of V. Then Na = A 2 B 2 , A' 
(1/A 2 )A, B- 1 



As elements of C + (V) the products B o X and B o A commute 
and we may write 

X a = ^-(A oB o A oB oX) =-oX 

Na Na 

where a 2 stands, of course, for a o . Hence 

Y 2 

V ^ V **- 2 

A o A = VJT--O: 

A'a 

(A' 2 is unambiguous and in k). If we apply ,7, 



hence 



The symbol 8(0) means ft + /3 J which is an element of k. The 
final result is 

1 / 2 \ 

- Y 2 i q( a I 

A ~ O\ - T j. 



The geometric! meaning of this formula becomes clear if we con- 
sider the case of a euclidean plane when k = R, the field of real 
numbers, G = I and when C*(V) is isomorphic'to the field of complex 
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numbers. If 8 is the angle of the complex number a, then a 2 /Na = e 2 " 
and $S(a 2 /Na) = cos 26 . Our formula means then that 20 is the 
angle of the rotation X X". 

Leaving the trivial case of a hyperbolic plane to the reader we 
can generalize this result. If V is not a hyperbolic plane, then G 
is not a square and C*(V) is isomorphic to the quadratic extension 



square and 
V^G) = K. 



field k(VG) 
We have 



where D is the group of elements of K with norm 1. If <r a e + 2 (V), 
then XX* /X 2 does not depend on X, its value is %S(a 2 /Na), the 
analogue of the cosine of the angle of rotation. 

II. n = 3. 

If A l , A 2 , A 3 is an orthogonal basis of F, then C + (V) is generated 
by 1, fi , i 2 , fa where t\ = A 2 o A 3 , z 2 = A 3 o A, , i 3 = A l o A 2 . 

The table of multiplication is: 

(fi f 2 ) = -fe ofO = -A$i 3 , i\ = -^2^3 , 

(i 2 Oi 3 ) = -(t, 0{ 2 ) = -A?t\ , Z*2 = -As A? , 

(fa <>fi) = -(fi O fs) = -Ajf 2 , fj = -A?^2 

An algebra of this form is called a generalized quaternion algebra. 

The map J is very simple: 1 is kept fixed and each i, changes the 

sign. Thus, if a = x + x^ + X 2 i 2 + 



Na = (z 

= xl + AlAlxl + A 3 2 A^ 2 2 + 



We merely repeat: D(V) are the quaternions with norm 5* 0, 
D 2 (F) those whose norm is in k* 2 and D (V) those with norm 1. Then 



^ 7>o(F)/{=bl) . 

A special case is of interest. Suppose V contains isotropic vectors. 
Instead of a hyperbolic pair N, M we use A t = N + %M, A a 
N \M and A 3 , a vector orthogonal to AI and A 2 . Since 

A? =1, Al = -1, A 3 2 = aefc* 
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we have 



tf 



(i 2 o 3 ) = (i 3 of a ) = i t , i\ = -a, 
(f 3 o i t ) = -(t\ o i 3 ) = i 2 , fj = +1. 

It is easy to find 2-by-2 matrices which multiply according to 
the following rules: 

t /I 0\ . /O o\ /O a\ /I 0\ 

1 = , I = , t 2 = , 1 3 = 

\0 I/ \1 O/ \-l O/ \0 -I/ 

They span the algebra of all 2-by-2 matrices over k and this 
algebra is, therefore, isomorphic to (T(7). The norm becomes a 
homomorphism of the matrices into k which is of degree 2 in the 
elements of the matrices and is merely the determinant. We see 
therefore 

OJ(V) a* PGL 2 (k), Q,(7) ~ PSL 2 (k). 

THEOREM 5.20. // dim V = 3 and if V contains isotropic vectors, 
then Q 3 (F) c^ PSL 2 (k), a simple group unless k = F 3 (the charac- 
teristic is 7* 2). 

III. n = 4. 

We denote the center C (F) = k + ke M of C + (V) by K, the 
elements of K which have an inverse by K * and their squares by K* 2 . 
J leaves every element of K fixed and we have, consequently, 
Na = a 2 for atK. An element a e K (a = a + be M ) will be regular 
if a 2 e k* and this implies (a 2 + b 2 e 2 M ) + 2a&0Af e fc*; either a or 6 
must be 0, the coset a-fc* is either fc* or e M -k*. The corresponding 
rotation <r a (X) = X a is either identity or the one induced by e M = 
AI o A 2 o A 3 o ^4 4 : 



The regular elements of K correspond, therefore, to the rotations 

dzlr- 

Denote by P the group of all elements of C*(V) which have an 
inverse. We compose the map 0\(V) > D(V)/k* with the map 
D(V)/k* > P/K* where the coset ak* is mapped onto the coset 
aK*. The coset ak* will lie in the kernel if a e K*', since a must be 
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regular, this corresponds to the rotations dbl F . We obtain the into 
map 

0\ (V) -> P/K* whoso kernel is 1 F 
and, by restriction, 

Oi (F) -> P/K* whose kernel is the center of Oi (V). 

The image aK* of an element <r a e 0\(V) is a coset where 
N(aK*) C fc*K* 2 . If, conversely, aK* has this property, then 
Not = a/3 2 where a e fc*, E If*. The generator a of the coset aK* 
may be changed by the factor /5 and we can achieve Na e k* which 
shows our coset to be the image of cr a . Should <r a E 0{(F), then 
Na E fc* 2 , hence N(aK*) = K**\ for a coset with this property one 
can achieve Na = 1 so that such a coset is indeed the image of 
some a a E 0[(V). 

If we factor out the kernels of the maps we get isomorphisms into: 

PO\-+P/K* and PO(-*P/K* 

where the images of P0\ are the cosets aK* whose norms are in 
k*K* 2 and those of P0't have norms in K* 2 . 

If <r a E 0\ , then a a and o- have the same image aK*. Their 
spinorial norms 0(O and 0(-~O are determined by the norms of 
the coset aK*:Na-K* 2 ; indeed aK* = ae M K* and Na-k* 2 , 
N(ae M )'k* 2 are the spinorial norms of <r a and <j a . 

Now we take a look at C + (V). We need only t\ = ^.20.43, t 2 = 
AZ o AI , i z = AI o A 2 which generate the quaternion algebra of 
the three-dimensional subspace W Q = (A l , A 2 , A 3 ) of V. With 
this quaternion algebra C + (W Q ) and e M A l o A 2 o /1 3 o ^. 4 , an 
element of the center C (F), we can get the missing products contain- 
ing A* . Indeed, (A 2 o A 3 ) o e M = i v o e M will be a scalar multiple of 
A i o At . This means now that 



Another way of interpreting this fact is to say that C + (V) is the 
quaternion algebra of W but with coefficients from C (V) instead 
offc. 

To get more information we must consider two possibilities. 

a) G is not a square in fc. 

We remark that e M o e M = A*AlAlAl = G so that e M can be 
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regarded as \/G and K as the quadratic extension field 
of k. K* consists now of the non-zero elements of K. If we call W 
the space obtained from W Q by enlarging k to X", then the quaternion 
algebra C+(W) may be identified with C + (V). The elements of the 
group P are merely D(W) and the elements whose norm is in K* 2 
form D 2 (W). Since D(W)/K* is isomorphic to 0\(W) and D 2 (W)/K* 
to ft 3 (TF), we obtain an isomorphism into 



The element <r ft of 0\(V) and 0-* are mapped onto the corre- 
sponding rotation X a e + 3 (W). The spinorial norms of <r a and o- a 
are 0(<r) = Na-k* 2 respectively 0( O = 0(<r. M ) = Na-G-k* 2 . 
The image X tt is a rotation whose spinorial norm 0(X) = Na-K* 2 
lies in fc*-Jff* 2 and 0(X a ) = 0(o-)-K* 2 gives the interrelation. As 
for POi(V), we obtain the isomorphism 

Oi(V) = P0i(7) ^ g(WO- 

Thus the orthogonal (projective) four-dimensional groups are 
isomorphic to certain three-dimensional ones. Unfortunately one 
does not get PQ 4 (F). 

Again an important special case: 

Let V contain isotropic vectors. V is certainly not hyperbolic 
since G = 1 is then a square. On the other hand, if G is a square and 
V contains the hyperbolic pair N, M, we could set V = {N, M) _L U. 
The plane U has discriminant G since that of (AT, M) is 1. Let 
Aj B be an orthogonal basis of U. We can solve the equation 
(xA + yB) 2 = x 2 A 2 + y 2 B 2 = in a non-trivial way since -B 2 /A 2 = 
G/(A 2 ) 2 is a square. This shows that V is hyperbolic if G is a square. 
Theorem 5.20 now implies 

THEOREM 5.21. Let dim V = 4 and the index of V be I. This 
means that there are isotropic vectors and that G is not a square. Then 

0((V) = Q*(V)~PSL 2 (k(VG, 

a simple group in all cases. 

b) G = c 2 is a square in fc. 

We may change A 4 by a factor and achieve G = 1. Then e 2 M = 1. 
Let W = (A l , A 2 , Az) (just over k as field) be the subspace of 
V orthogonal to A 4 and C + (W ) its quaternion algebra. Then C + (V) = 
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C+(W ) + C + (Tfo)* . Put fi t = J(l + e M ), u 2 = J(l - e M ), then 
we have 1 = w t + u 2 , e M = ^ t w 2 , ft? = u\ , w 2 = #2 , w,w 2 = 0- 
We may write'now (since K = k + ke Ar ) 



Let a C + (F), a = put! + yu 2 with j8, 7 e C f (TT ). The element a 
has an inverse if and only if ft and 7 have. The antiautomorphism 
/ is identity on K and leaves, therefore, u and w 2 fixed. We find 



Na = tfjS-ti! + #7-^2 = KW + #7) + J(#0 - Ny)e M . 
Na E /c means Np = Ny, and then Na = Nfi = Ny. 
p = D(W )u l + D(W )u 2 , K* 



Hence the factor group P//f * is described by the direct product of 
isomorphic groups, 

D(WJ/k* X Wo)/**, 

and an element in P/K* by a pair (pk*, yk*) of cosets, aK* > 

(0fc*,7/c*). 

Each D(W )/k* ~ Ol(W Q ), the coset 0fc* being mapped onto an 
element \ ft e 0* 9 (W ) such that 0(X^) = Np-k**. Wo have, therefore, 
an isomorphism into: 

PO\(V) -* + ,(TT ) X 



and the images (X/j, X 7 ) of the element o- a c 0"i(F) satisfy 
B(\ y ) = 0((r a ). By restriction 

PO((V) ~ Q 3 (TT ) X fl 3 (TF ) 

and PO^F) is thus a direct product. 

We look again at the case in which V contains isotropic vectors; 
we may assume that W does also contain isotropic vectors and know 
already that V is hyperbolic. 

THEOREM 5.22. //dim V = 4 and if V is hyperbolic, then PQ 4 (F) = 
POi(V) ^ PSL 2 (k) X PSL 2 (k). 

Another interesting case is that of a euclidean space of dimension 
4 which we avoided in 2. 
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THEOREM 5.23. // V is euclidean and dim V = 4, then PO\(V) = 
PQ 4 (F) = 0\(W Q ) X 0* 3 (W ), a direct product of two simple groups 
since W Q is euclidean. 

In relativity one studies a four-dimensional space V over the 
field R of real numbers with an orthogonal geometry based on the 
quadratic form / = xl + xl + xl xl . Since isotropic vectors 
exist, the group 0\/$l is isomorphic to R*/R* 2 which is of order 2. 
The discriminant is 1, not a square in R. Hence P0 4 = 12 4 ~ 
PSL 2 (R(i)), the projective unimodular group of two dimensions 
over the field of complex numbers; 12 4 is, therefore, a simple group 
which is called the Lorentz group. The reader may work out for 
himself that the Lorentz group consists of those rotations of V which 
move the vectors with X 2 < but :r 4 > into vectors of the same 
type, in other words, rotations which do not interchange past and 
future. 

7. The structure of the group Q (V) 

In 6 we have elucidated the structure of the orthogonal groups 
for dimensions < 4; we shall now investigate the higher dimensional 
cases. It will be convenient to adopt the following terminology: 
If U is a non-singular subspace of V and if T t 0(C7), then we shall 
identify T with the element T _L It/* of 0(F), merely write T for this 
element (if there is no danger of a misunderstanding) and say that 
T is an isometry of the subspace C7. It is then clear what is meant 
by a plane or by a three-dimensional rotation of V. 

Can we improve on Witt's theorem? We shall answer this question 
only for finite fields. 

THEOREM 5.24. Suppose that k is a finite field and T an isometry 
of a subspace Ui of V onto some subspace f/ 2 of V. If U 2 is contained 
in a non-singular subspace W of codimension > 2, then we can extend 
T to an element p of the group 12(F). 

Proof: We can find a rotation <r of V which extends r. If p is a 
rotation of W*, then per also extends T. Since dim W* > 2 and since 
k is a finite field, the space W* contains vectors with arbitrarily 
given squares. One can find a p such that its spinorial norm 0(p) is 
equal to that of <r. Then 0(pcr) = 1 which shows per e 



COROLLARY. If k is a finite field, dim V > 4 and N, M two non- 
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zero isotropic vectors, then there exists an element X e ft(F) such 
that \N = M. 

Proof: M is contained in some hyperbolic plane. 

A similar but weaker statement is true for arbitrary fields: 

THEOREM 5.25. Let C7j and U 2 be non-singular isometric subspaces 
of V and suppose that U 2 contains isotropic lines. There i s a X e 12(7) 
such that C7 2 = XtA . 

Proof: We can find a rotation a such that U 2 = <rUi and may 
follow it by any rotation p of U 2 . Since U 2 contains isotropic lines, 
we can achieve 0(p) = 6(0). Setting X = p<r e 12(7) we have U 2 = Xf/i . 

COROLLARY. If dim V > 3, and if N, M are isotropic vectors j* 0, 
then there exists a c e fc* such that for any d e fc* an element X e 12(7) 
can be found for which \N = cd 2 M. 

Proof: Let M, M' be a hyperbolic pair. We can find a rotation a 
such that <rN = M. Let 0(<r) = ck* 2 and let p be the rotation of 
(M, M 1 ) which sends M onto cd 2 M. Then 0(p) = 0(cr) and X = p<7 
is the desired element of 12(7). 

We use these corollaries to prove 

LEMMA 5.4. Let dim V > 5, P = (A, ) a singular plane of V 
where A 2 = B 2 7* 0. T&ere ezis/s a X c 12(7) w/wcA fceeps B fixed and 
sends A into a vector C such that (A, C) is a non-singular plane. 

Proof: If (N) is the radical of P, then P = <B, N) and we have 
A = aB + $V; since A 2 == B 2 we get a = 1. We may replace 
otB by B and 0AT by # and obtain A = B + N. 

Let H = (B)* be the hyperplane orthogonal to B. The isotropic 
vector N ^ belongs to H and we can find an isotropic vector M 
in H such that NAf = A 2 . If fc is finite, there exists a X c 
such that \N = M (we can extend X to 7). We obtain 



and 

AC = (B + N)(B + M) = B 2 + NM = A 2 - A 2 = 0. 

Since C 2 = A a , the plane (A, C) is non-singular. If fc is infinite we 
can achieve \N = cd 2 M for a suitable c and any d. We have \B = J5 r 
XA = B + cd 2 M = C, A 2 = C 2 , AC - JB 2 - cd 2 A 2 . The plane 
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(A, C) is non-singular if A 2 C 2 - (AC) 2 7* or (A 2 ) 2 ^ (AC) 2 . We 
can choose d in such a way that 

AC = B* - cd 2 A 2 5* dbA 2 . 
We shall also need 

LEMMA 5.5. Let A be a non^isotropic vector of V and suppose 
a is an isometry of V which keeps every line L generated by a vector B, 
satisfying JB 2 = A 2 , fixed. We can conclude a = 1^ , if k either 
contains more than 5 elements, or if k = F 6 but n > 3, or if k = F 3 
but n > 4. 

Proof: Replacing o- by <r, if necessary, we can achieve that 
a A = A. Let H = (A)* and assume first that H contains isotropic 
lines (N). Since (A + N) 2 = A 2 , we have a(A + N) = c(A + N) 
with c = 1. The image 

aN = <r((A + N) - A) = (A + N} - A 

of N must be isotropic which implies 6 = +1 and, therefore, o-AT = JV- 
An arbitrary vector X of # is contained in some hyperbolic plane 
of H and we see <rX = X, which, together with a A = A, proves a = 1. 

If // is anisotropic, let C 5* be a vector of //. If there are at 
least six rotations of the plane P = (A, C), they will move the line 
(A) into at least three distinct lines of P which, by our assumption, 
are kept fixed by a. Recalling <rA = A, we find <rC = C. If k con- 
tains at least seven elements, this condition is satisfied for any vector 
C of H and we get again a = 1. If k = F 6 , then we may assume 
that H is anisotropic and, hence, dim H = 2. H contains two inde- 
pendent vectors C and D which satisfy C 2 = D 2 = 2A 2 and the 
planes (A, C) and (A, D) are anisotropic. Such planes have six rota- 
tions and we get again a 1. For k = F 3 , we have supposed n > 4 
and, hence, dim H > 3; H will contain isotropic vectors and our 
lemma is proved in all cases. 

We are now in a position to prove 

THEOREM 5.26. Suppose dim V > 5 and let G be a subgroup of 
0(V) which is invariant under transformation by elements of 12 (V) 
and which is not contained in the center of 0(V). Then the group G will 
contain an element a ? 1 which is the square of a three-dimensional 
rotation. 
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Proof: 

1) Select 0- e (?, a- j 1^ ; o- must move some non-isotropic line 
(A). We first contend that, for a suitable o-, the plane P = (A, a A) 
will be non-singular. Suppose, therefore, that P is singular. Since 
(a-A) 2 = A 2 we can use Lemma 5.4 and find a X e fl(F) that keeps 
a A fixed and moves A into a vector C = X(A) for which the plane 
(A, C) is non-singular. The element p = XcTW is also in G and 
we have 

p(A) = A^'V^A) = XcT'M) = C. 

2) We assume, therefore, that P = (A, crA) is a non-singular 
plane. We shall construct an element p 7* l r of G which keeps 
some non-isotropic line fixed; we may assume that <r moves every 
non-isotropic line, otherwise we could take p = a. There is a X e fi(F) 
which keeps every vector of P fixed and does not commute with <r. 
Suppose we had found such a X. Set p = \~ l <r~ l \a; it is an element 
of G and 

p(A) = A'V'AGrA) = A'V'M) = A'XA) = A. 

Since p(A) = A we have p 7* 1 r and, since X and <r do not commute, 
we have p 5* l v . To construct the desired X we distinguish two 
cases. If crP 7* P, let B be a vector of P such that <rB 4 P. We may 
write <rB = C + D, C t P, D t P* and D ^0. Since dim P* > 3, 
there is a X e Q(P*) which moves the vector D. For this X (extended 
to F) we find 

Atr(B) = A(C + D) = C + XD 5* C + Z> = <rB = crA(B); 

hence Xo- 5^ <rX. If ^P = P, then (rP* = P*, but the restriction r 
of a to P* is not in the center of 0(P*) (since or moves the non- 
isotropic lines). For a certain X e 12(P*) we have rX j* Xr, hence 
crX 5^ Xo-. 

3) Let, therefore, er 5^ it l r be an element of G which keeps a 
certain non-isotropic line (A} fixed. By Lemma 5.5 there exists a 
vector B which satisfies B 2 = A 2 such that the line (B) is moved 
by a. Set ffB = C, then (B) 7* (C). We denote by T A the symmetry 
with respect to the hyperplane (A)*. By Witt's theorem there is a 
H t 0(V) such that juA = B. Then HT A IJ.~ I = T B , hence 

A = TuT A = fJLTAfJL^TA 1 t 12(7). 
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We have <TT A <T~ I = r oA = T A and <JT B <T* = r c , hence 



and 



% -i -1 -i 

(TACT = (TTBff t <rr A (T 



P = 0*X(T"~ X~ = T C T A T A T B 



This element p lies in (?, it is 3* 1 sincQ (C) ^ (B), it belongs to 
Q(7) since it is a commutator. The radical of the plane P = (B, C) 
has at most dimension 1 and is the radical of the space P*; P* con- 
tains, therefore, a non-singular subspace W of dimension n 3. 
The element p keeps every vector of P*, hence every vector of W, 
fixed. If U is the three-dimensional space orthogonal to W, then p 
is a rotation of U. It belongs to 8(F), thus to O'(V), and, therefore, 
to O'(t7) = Q(J7). It follows that p is the square of a rotation of U. 
Our theorem is proved. 

If V is anisotropic, then p is the square of a plane rotation and 
this exhausts our present knowledge of the structure of &(F) in this 
case. 

If V contains isotropic lines one can say much more. Suppose 
the subspace U is anisotropic; then the axis of the rotation p is not 
isotropic, p is the square of a rotation of an anisotropic plane P. 
We shall prove that any anisotropic plane P can be imbedded in a 
three-dimensional non-singular subspace U 1 of V which contains 
isotropic lines; the element p may be regarded as a rotation of U' 
and we see that it is no loss of generality to assume that the original 
U contains isotropic lines. Select an isotropic vector AT j* of V 
which is not orthogonal to P; this is possible since, otherwise, all 
isotropic lines of V would be in P* and, therefore, left fixed by the 
rotations of P. We set U' = P + (AT); if U f were singular and (M) 
its radical, then (M ) would be the only isotropic line of [/', (M) = (N), 
contradicting the choice of N. 

This shows also that the generators (r A T B ) 2 of 0(7) (see Theorem 
3.23) are squares of rotations of three-dimensional subspaces U 
which contain isotropic lines. 

Let us return to our element p 9* 1 ot and suppose the field k 
contains more than three elements. The group 12(C7) is simple (Theo- 
rem 5.20), G C\ J2(C7) contains p and is an invariant subgroup of 
0(C7); this implies Q(C7) C G. The subspace U contains a hyperbolic 
plane P and G contains &(P). Let U 1 be another three-dimensional 
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subspace with isotropic lines; it contains a hyperbolic plane P'. 
By Theorem 5.25 we can find a X e 0(7) such that P' = XP. Since 
O(P') = XQ(P)A" 1 we conclude that O(P') C G. The group 0(17') 
is simple and 0([/') H G contains O(P'). It follows that 0(t/') C G. 
The group G contains all generators of 0(7) and consequently all 
of 0(7). 

If k = FZ , we use the fact that a space over a finite field contains 
proper subspaces with any prescribed geometry. Since dim V > 5, 
we can find a subspace V of dimension 4 which is of index 1 (is not 
a hyperbolic space). The subspace V' Q contains a three-dimensional 
subspace V which is isometric to U. Using Witt's theorem we see 
that U is contained in a four-dimensional space 7 which is of 
index 1. The group 0(7 ) is simple (Theorem 5.21) and 0(7 ) C\ G 
contains p. Hence 0(7 ) C G and we can conclude again that 
0(70 C G where 7i is any four-dimensional subspace which is of 
index 1. If U' is any three-dimensional subspace of 7, we can imbed 
U' in such a space 7, and deduce 12(17') C G. It follows that 12(7) C G 
and we have proved 

THEOREM 5.27. Suppose dim 7 > 5, 7 a space with isotropic 
lines, and let G be a subgroup of 0(7) which is invariant under trans- 
formation by 0(7) and which is not contained in the center of 0(7). 
Then 12(7) C G. If P12(7) is the factor group of 12(7) modulo its 
center, then P12(7) is simple. 

This fundamental theorem was first proved by L. E. Dickson in 
special cases and then by J. Dieudonn6 in full generality. Let us 
look at some examples. 

1) k = 72, the field of real numbers. If dim 7 = n > 3 and if the 
geometry is based on the quadratic form ^ x 2 ^ y 2 , then the 
geometry is determined by n and by the number r of negative terms. 
The case r = is disposed of by using the arguments in 2 and Theo- 
rem 5.23; the case r n leads to the same groups arid is also settled. 
In every other case isotropic lines are present and Theorems 5.20, 
5.21, 5.22, 5.27 show that P12(7) is simple unless 7 is a hyperbolic 
space of dimension 4. Since R*/R* 2 is of order 2, 12(7) is of index 
2 in + (7) if < r < n. The discriminant of 7 is (- l) r ; the element 
l v belongs to 0(7) if and only if both n and r are even. 

2) k = F q , a field with q elements. The group k*/k* 2 is again of 
order 2 so that 0(7) is of index 2 in + (7), even if 7 is an anisotropic 
plane (since the squares of the vectors 7* of 7 range over k*). 
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The order of fl(7) is, therefore, half the order of + (V). To get the 
order of Ptt(F) one would have to divide by 2 again whenever the 
discriminant of V is a square and n is even. For an even n we had 
attached a sign e = 1 to the two possible geometries. If = +1, 
then the discriminant is ( l) n/2 ; if e = 1, then it is ( l) n/2 
where g is a non-square of F Q . The reader can work out for himself 
that the discriminant is a square if and only if q n/2 e is divisible 
by 4. The order of PJ2 n (C7) is, therefore, given by 

(n-i)/4 B j-j ^2.- __ ^ fornodd, 
^ t-i 



nn-n _ f . , _ 1 

tt ,! 

where d is the greatest common divisor of 4 and </ n/2 . 

In 1 we determined the order of PSp n (F a ); if # is odd, then the d 
appearing in this order is 2. Thus the orders of PSp 2m (F q ) and of 
P^ 2m+l (F a ) are equal. It has been shown that PSp 4 (F g ) and PQ 5 (F q ) 
are isomorphic groups but that the groups PSp 2m (F q ) and Ptt 2m+l (F Q ) 
are not isomorphic if m > 3. One has, therefore, infinitely many 
pairs of simple, non-isomorphic finite groups with the same order. 
These pairs and the pair with the order 20160 which we have dis- 
cussed in Chapter IV are the only pairs known to have this property; 
it would, of course, be an extremely hard problem of group theory 
to prove that there are really no others. 

Should V be anisotropic, then the structure of 12(F) is known 
only for special fields k. M. Kneser and J. Dieudonn6 have proved 
that Pfi(F) is always simple if dim V > 5 and if k is an algebraic 
number field. This is due to the fact that V can not be elliptic for a 
valuation of k if k is a number field and if dim V > 5. It is of course 
conceivable that for arbitrary fields a similar result can be proved. 
One might have to assume that V is not elliptic for any of the valua- 
tions of k and possibly dim V > 5. 

EXERCISE. Let us call a group G strongly simple if one can find 
an integer N with the following property: Given any element a j 1 
of G] every element X of G is a product of at most N transforms 
To-* 1 ?-" 1 of <r* l (T e (?). Show that, apart from trivial cases, the groups 
PSL n (k), PSp n (k) and PQ n (if V contains isotropic lines) are strongly 
simple provided that k is commutative. Show that 0\(V) is not 
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strongly simple if V is a euclidean space. It is doubtful whether 
PSL n (k) is strongly simple for all non-commutative fields. What is 
the trouble? A reader acquainted with topological groups may 
prove that an infinite compact group can not be strongly simple. 
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To Chapter I: A deficiency in elementary algebra can be remedied by a little 
reading in any book on modern algebra, for instance [9]. The duality theory of 
vector spaces as well as of abelian groups (4 and 6) can be generalized if one 
introduces topological notions. See for instance [7], Chapter II, and [8]. 

To Chapter II: For projective geometry [1] may be consulted. I suggest, for the 
most fascinating developments in projective geometry, a study of [3]; read [5], 
Chapter III, for a survey. We have completely left aside the interesting problem 
of non-Desarguian planes. The reader will get an idea of these problems if he looks 
at some of the articles in [9]. He will find there some bibliography. 

To Chapters III-V: We have mentioned the important case of a unitary geom- 
etry and have only casually introduced orthogonal geometry if the characteristic 
is 2. The best way to get acquainted with these topics is again to read [5]; in [2] 
and [6] more details on orthogonal groups will be found. 
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